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ABSTRACT 


This  thesis  studied  the  relationship  between  teacher 
leader  behavior  in  the  classroom  and  ratings  of  teacher 
effectiveness.  Leader  behavior  dimensions  of  Consideration 
and  Initiating  Structure  were  considered  the  crucial  vari¬ 
ables  within  a  theoretical  framework  of  variables  for  study¬ 
ing  teacher  effectiveness. 

In  12  schools  in  Alberta  59  teachers  in  grades  8  to 
12  were  the  centre  of  the  study.  They  were  rated  on  teaching 
effectiveness  and  their  leader  behavior  was  described  by  them¬ 
selves.,  by  571  students,  50  other  teachers  on  the  same  staff, 
12  principals,  and  12  superintendents.  The  Teacher  Leader 
Behavior  Description  Questionnaire  was  used  to  describe  the 
teacher's  leader  behavior  and  a  global  scale  was  used  to  give 
a  single  rating  of  his  teaching  effectiveness. 

The  study  supported  the  following  conclusions: 

1.  The  most  effective  teachers  were  those  who  were 
described  as  being  above  the  average  on  Consideration  and 
Initiating  Structure  behavior.  The  least  effective  teachers 
were  those  who  were  described  as  being  below  the  average  on 
both  dimensions  of  leader  behavior. 


2.  Students  related  effective  teaching  to  both 
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Consideration  and  Initiating  Structure  behavior.  There  was 
a  tendency  for  Consideration  behavior  to  be  more  closely 
related  to  ratings  of  effectiveness. 

3.  Administrators  related  effective  teaching  to 
both  Consideration  and  Initiating  Structure  behavior,  but 
neither  dimension  was  more  closely  related  to  ratings  of 
effectiveness . 

Summing  up,  one  of  the  factors  related  to  teacher 
effectiveness  in  the  classroom  was  the  extent  to  which  the 
teacher  exhibited  human  relations  behavior  and  behavior  which 
attempted  to  get  things  done  or  accomplish  goals.  This 
appeared  true  in  teaching  grades  eight  to  twelve,  in  both 
academic  and  non-academic  subjects,  and  in  a  wide  variety 
of  non-urban  schools. 
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CHAPTER  I 


THE  PROBLEM 

The  institution  of  education  is  based  on  the  teachers 
in  the  classrooms.  Improvement  of  the  effectiveness  of  the 
teacher  is  of  concern  to  teachers,  administrators,  students, 
parents,  trustees  and  the  general  public.  Studies  of  teach¬ 
ing  have  analyzed  many  aspects  of  the  work  of  the  teacher 
with  the  hope  that  some  insight  into  his  work  will  provide  a 
background  for  improvement  in  effectiveness.  This  thesis  is 
a  further  study  of  teacher  effectiveness. 

I.  THE  GENERAL  PROBLEM 

This  research  was  concerned  with  an  attempt  at  describ¬ 
ing  specific  teacher  behavior  and  the  relationship  of  this 
behavior  to  teacher  effectiveness.  This  attempt  originated 
from  empirical  evidence  in  the  field  of  leadership  which  was 
considered  applicable  to  the  field  of  teaching.  Evidence 
gathered  through  the  Ohio  State  University  Leadership  Studies 
initiated  in  19^5  has  provided  a  means  of  describing  leader 
behavior.  The  teacher  in  the  classroom  is  in  a  leadership 
situation  where  he  is  interested  in  goal  achievement  and  human 
relations.  Thus,  the  problem  studied  was  the  relationship 
between  descriptions  of  teacher  leader  behavior  and  judgments 


of  teacher  effectiveness. 


..  i .  C  )!  •!.  .  j  . .  \j  .  •-  • .. 

.  j  .  •  ’  ' 


.  0.1  .'. 

v  .L 

.  Jr.  * 

31303.  ><  .  •  t .  ;  :  .  '  ,  3  , 

•  '  *  ■ 

.3  . 

.  ;•  a  ...•  ... 

• 

•  v>4  'J 

© 

.  .  •  ti.  d  .  .  .  «IC  t  .  •  .  :  : 


.  j‘  V  JL  •  3>.  ... .  _L  ’  1  .•  •  d  3  ..  *  .L 


.  VI.  .  ..... 


.  •  •  .;  j:  •:  .3  ..  J  b  t  ,o\.  ...... 

. ! .  -  •  j  ...  \j  j .*  .. .  v.£  .U  a  .3  .  v  .  >  *  '  .l  O .'  .  . .  ...  \_i  t 

•  v  .iJo  v.  j-.  ■  . -1  n  \  ■.  c  *:•  >  .  ). 

.  '■  i  ■  .  ..',3...  Mail  J  j  •/.  v  -•  y.r.oi'S.  m  :> 

•  V  :  .  ;  „  ,>d  1  bf.  ..V.  .  •  v  •  ..  :.u. .  -  ...r.  .3  .  ..a  )b.  k 

■  '  •  ,  .  •'  ...  .  ,  .  .  ague* 

•  .  CUB  >lVGrI  Ul  ;  :  ■ 

I-  \  .  1  •:  ..  ■  \  ,  ■  ....  _  j:  .  .  .  .W  ...  J.  .  , 

:  -  . >  J  . <  .  i  . . .  ■  .j.  ,  . .  .  .......  ‘  j.  )r 

•W—  o  --  -  -  -v  3  .  ...31  .  ‘3;  a:l.;jJ  ’j. .  ,f..  ...J  1  ‘  .3  j 


.  •  : 


2 


II.  NEED  FOR  STUDY  OF  TEACHER  EFFECTIVENESS 


When  one  hears  the  saying,  nA  school  is  no  better  than 
its  teachers.”,  the  importance  of  the  teacher  is  well  illus¬ 
trated.  The  improvement  of  our  educational  system  is  depend¬ 
ent  on  the  improvement  of  our  training  of  teachers  both  in 
the  teacher  training  institutions  and  in  the  field.  This 
improvement  of  teachers  is  dependent  upon  better  selection  of 
candidates  to  eliminate  those  who  will  be  ineffective.  These 
attempts  at  improvement  are  based  on  the  assumption  that  we 
are  able  to  define  an  effective  teacher. 

That  we  are  not  yet  able  to  define  the  effective 
teacher  is  supported  by  the  following  statements.  The  Ency¬ 
clopedia  of  Eductional  Research  states, 

...  it  is  apparent  that  the  identification  and 
definition  of  teaching  competencies  is  as  yet  by  no 
means  satisfactory.  .  .  .  we  do  not  yet  have  an  ade¬ 
quate  definition  of  teaching  efficiency.  .  .  . 

(4,  p.  1453) 

Kinney  says. 

Clearly,  then,  whether  we  are  administrators, 
supervisors,  teacher  educators,  placement  officers, 
parents,  teachers,  students,  or  merely  one  among 
the  various  categories  of  critics  of  the  public 
schools,  each  of  us  has  his  definition  of  teach¬ 
ing  competency--fuzzy  or  clear,  implicit  or 
explicit,  narrow  or  comprehensive,  it  serves  as 
the  basis  for  significant  decisions.  Unless  we 
arrive  at  a  common  understanding,  the  result  must 
continue  as  confusion,  controversy,  and  inef¬ 
ficiency.  .  .  .  (3*  P*  l) 
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Ackerman  states,  ”...  there  is  hardly  any  conclusive  evi¬ 
dence  as  to  the  nature  and  means  of  identifying  teacher 
competence.”  (l,  p.  273) 

We  have  assumed  that  there  is  a  need  to  improve 
teacher  effectiveness.  An  attempt  at  improvement  first 
requires  an  evaluation  of  how  effective  teachers  are  at 
present.  But  without  a  definition  or  an  understanding  of 
teacher  effectiveness,  an  evaluation  cannot  take  place. 

Thus,  improvement  of  teaching  is  restricted. 

Without  an  understanding  of  what  makes  a  teacher 
effective,  we  have  difficulty  in  preparing  the  most  suitable 
teacher  training  program.  Ryans  believes  that  we  are  certain 
that  we  want  as  many  good  teachers  as  possible.  To  obtain 
these  teachers  we  have  created  systems  for  selecting  teachers, 
have  devised  programs  of  training,  and  set  up  systems  of 
supervision  to  improve  teaching.  These  all  have  been  based  on 
common  sense  and  practical  wisdom  with  little  scientific  foun¬ 
dation,  but  with  high  hopes  and  deep  conviction  that  we  are 
doing  the  best  (5*  p.  238).  The  need  for  a  study  of  teacher 
effectiveness  is  apparent  if  we  desire  to  improve  our  train¬ 
ing  of  teachers. 

Our  supervision  of  teachers  implies  that  the  super¬ 
visor  desires  to  improve  instruction,  but  without  some  under¬ 
standing  of  what  makes  an  effective  teacher  much  of  our 
supervisory  effort  must  be  wasted.  Burton  and  Brueckner 
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state,, 


.  .  .  All  in  all,  teaching  is  a  very  complex 
activity,  and  the  haphazard,  unscientific,  and 
superficial  study  of  teaching  that  characterizes 
much  of  our  supervision  today  should  not  be 
tolerated.  .  .  .  Just  as  we  have  developed 
improved  methods  of  studying  pupils  and  their 
habits  of  work,  so  we  must  develop  improved 
methods  of  studying  and  assisting  teachers. 

(2,  p.  359) 


III.  LIMITATION  OF  THE  STUDY 


The  attempt  to  solve  the  problem  of  teacher  effective¬ 
ness  has  been  characterized  by  two  lines  of  attack.  One  has 
been  the  traits  approach,  assuming  that  effectiveness  is 
dependent  on  the  individual  teachers.  The  other,  which  has 
received  less  emphasis,  has  been  the  situational  approach, 
assuming  effectiveness  is  dependent  on  the  situation  in  which 
the  teacher  is  placed. 

This  study  assumed  that  both  the  behavior  of  the 
individual  teacher  and  the  situation  in  which  he  is  placed 
are  related  to  effectiveness.  The  study  was  limited,  though, 
by  being  concerned  only  with  the  behavior  of  the  teachers  no 
matter  how  variable  were  the  situations  in  which  they  taught. 

The  study  was  not  concerned  with  character  traits  of 
teachers  since  research  on  evaluation  of  teacher  effective¬ 
ness  has  shown  this  approach  to  be  fruitless.  The  study  was 
concerned  with  teacher  leader  behavior  in  the  classroom. 
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No  attempt  was  made  to  relate  leader  behavior  to  the 
type  of  school  system,  subjects  or  grades  taught,  experience 
and  training  of  the  teachers,  type  of  community  or  any  of 
the  numerous  other,  situational  variables. 

Teacher  effectiveness  was  limited  to  the  actual 
teaching  of  students  in  the  classroom  and  was  defined  as 
the  opinions  of  five  sets  of  raters.  These  were  students, 
principals,  superintendents,  other  teachers  on  the  same 
staff,  and  the  teachers  themselves. 

IV.  SPECIFIC  PROBLEMS 

1.  The  major  problem  was  to  study  the  relationship 
between  descriptions  of  teacher  leader  behavior  and  ratings 
of  effectiveness  when  the  descriptions  of  leader  behavior 
and  ratings  of  effectiveness  were  provided  by  students, 
other  teachers,  principals,  superintendents  and  the 
teachers  themselves. 

2.  A  secondary  problem  was  concerned  with  the 
various  viewpoints  that  the  five  sets  of  raters  would  pro¬ 
vide  when  rating  the  teacher* s  effectiveness.  Thus  a  second 
problem  was  to  study  the  relationships  between  the  effect¬ 
iveness  ratings  provided  by  these  raters. 

3.  A  parallel  problem  to  the  second  one  was  to  study 
the  relationships  between  the  descriptions  of  the  teacher's 
leader  behavior  provided  by  these  five  sets  of  raters. 
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CHAPTER  II 


RELATED  LITERATURE 

This  chapter  reviews  some  of  the  literature  related 
to  previous  approaches  to  the  solution  of  the  problem  of 
teacher  effectiveness,  some  of  the  literature  related  to 
evaluation  of  teacher  effectiveness,  and  literature  con¬ 
cerning  the  processes  of  perceiving  and  judging  the  behavior 
and  effectiveness  of  individuals. 

I.  APPROACHES  TO  THE  STUDY  OF  TEACHER  EFFECTIVENESS 

Four  approaches  are  reported  in  this  section.  In 
the  first,  teacher  effectiveness  was  studied  through  the 
results  produced  in  the  pupils.  In  the  second  approach, 
teacher  effectiveness  was  studied  through  the  traits  which 
are  characteristic  of  good  and  poor  teachers.  The  third 
approach  was  to  study  teacher  performance  and  the  final 
approach  reported  was  the  study  of  teacher  behavior. 

Teacher  Effectiveness  Studied  through  the  Results  Produced 
in  Pupils 

The  extreme  example  of  this  type  of  approach  was 
concerned  with  changes  in  the  pupil  as  a  member  of  the 
community.  What  changes  in  the  members  of  the  community 
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(the  former  pupils)  were  brought  about  by  the  teacher?  What 
kind  of  teachers  or  teaching  brought  about  this  desirable 
change?  In  favour  of  this  approach  are  some  sociologists 
such  as  Payne  who  believed  we  must  use  devices  measuring 
total  behavior  changes  in  family  and  community  to  evaluate 
effectiveness  of  education  (12,  p.  6j0) .  He  states,  .  . 
results  of  education  should  be  measured  in  the  community 
rather  than  in  the  school.”  (12,  p.  671)  This  approach 
requires  agreement  firstly,  on  a  definition  of  a  desirable 
change  in  the  pupil,  and  secondly,  isolation  of  the  effects 
of  the  teacher  on  the  pupil  as  distinct  from  the  effects  of 
other  aspects  of  the  community.  Both  of  the  above  require¬ 
ments  appear  very  difficult  to  obtain. 

A  great  deal  has  been  reported  using  pupil  growth  and 
achievement  in  school  as  the  criterion  for  effective  and 
ineffective  teaching.  There  has  been  much  disagreement  over 
this  type  of  approach.  It  has  been  difficult  to  isolate  the 
effect  of  a  single  teacher  and  also  it  has  been  difficult  to 
control  the  many  pupil  variables.  Many  of  the  criteria  of 
successful  and  non-successful  pupils  have  been  in  the  form 
of  standardized  tests  and  although  these  may  indicate  some 
aspects  of  desirable  pupil  change,  they  do  not  indicate  all 
aspects.  Biber  believes  that  it  may  be  possible  through 
intensive  psychological  studies,  using  trained  personnel 
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and  by  controlling  such  factors  as  family  and  socio-economic 
background,  to  study  pupil  growth  as  a  measure  of  the  cumu¬ 
lative  effect  of  education,  but  not  as  an  index  of  an  indi¬ 
vidual  teacher’s  effectiveness  (3*  p.  215).  Reavis  and 
Cooper  summarize  the  research  using  this  approach: 

.  .  .  one  of  the  major  efforts  of  the  past 
twenty  years  in  teacher  evaluation  has  been  the 
attempt  to  isolate  the  influence  of  teaching 
ability  as  a  single  variable  affecting  pupil 
achievement.  Age,  intelligence,  initial  achieve¬ 
ment  and  socio-economic  status  of  pupils  have  been 
held  constant  in  various  significant  studies  of 
pupil  achievement.  Several  types  of  accomplish¬ 
ment  quotients  and  pupil  progress  indices  have 
resulted  from  these  studies. 

The  disappointing  feature  of  attempts  to 
relate  pupil  results  to  teaching  success  has 
been  that  pupil  results  have  not  correlated 
highly  with  other  measures  of  teacher  success. 

In  particular,  the  correlations  between  pupil 
results  and  administrator  or  supervisor  ratings 
have  been  low.  (13j  pp.  100,  101) 

Ackerman,  in  reviewing,  concludes, 

.  .  .  more  than  thirty  studies  were  reviewed. 

In  each  of  these  studies  an  attempt  was  made  to 
evaluate  teacher  competence  and  effectiveness  by 
the  use  of  the  criterion  of  pupil  change.  In 
general  the  results  are  contradictory  and  incon¬ 
sistent.  (l,  p.  284) 


Teacher  Effectiveness  Studied  through  the  Traits  of  Teachers 


The  assumption  behind  this  approach  has  been  that  if 
one  can  determine  those  traits  which  are  characteristic  of 
good  and  poor  teachers,  one  would  be  able  to  predict  which 
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teachers  would  be  most  effective  in  the  classroom,  school 
and  community. 

A  detailed  traits  approach  was  used  in  the  Common¬ 
wealth  Teacher  Training  Study  and  reported  by  Charters  and 
Waples  in  1929.  About  this  study,  Beecher  notes  that  a 
master  list  of  twenty-five  teacher  traits  was  developed 
which  seemed  to  indicate  nothing  unique  about  teachers 
(2,  p.  13). 

Gray,  when  surveying  the  present  practices  in  Canada, 
Great  Britain  and  the  United  States,  found  four  qualities  of 
teachers  most  important.  They  were  attitude  toward  work, 
personality,  spirit  in  the  classroom,  and  stimulation  of 
initiative  (5*  p.  171). 

Teacher  intelligence  has  been  a  well  studied  trait 
and  the  lack  of  conclusive  evidence,  often  a  negative  cor¬ 
relation,  has  been  disquieting  to  teacher  educators. 
Beecher’s  summary  describes  a  comprehensive  study  under  the 
auspices  of  the  University  of  Wisconsin  and  The  State 
Department  of  Public  Instruction  in  Wisconsin  which  was 
carried  out  from  193^-  to  19^-.  One  phase  of  the  study  indi¬ 
cated  that  intelligence  of  the  teacher  seemed  to  be  the 
highest  conditioning  factor  in  teacher  success,  but  the 
conclusion  was  qualified  by  stating  that  the  results  should 
be  verified  (2,  p.  27).  This  was  done  by  Rolfe  using  the 
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same  procedure  and  it  was  found  that  intelligence  showed  a 
small  negative  correlation  with  teacher  success  (2,  p.  28). 

Prom  her  research  Morris  reported  that. 

Examination  of  both  temperamental  and  intel¬ 
lectual  traits  show,  further,  that  there  is 
probably  no  unique  blend  of  traits  so  character¬ 
istic  of  the  successful  teacher,  as  markedly 
different  from  the  successful  librarian  or  other 
professional  representative,  that  this  blend  or 
constellation  of  traits  may  in  itself  be  used  as 
the  basis  for  selecting  teachers.  But  there  seem 
to  be  differences  in  the  subtle  ’balance  of  power’ 
that  reflect  the  varying  degrees  of  certain  traits 
as  they  are  combined  in  different  individuals. 

(10,  p.2) 

If  one  were  able  to  compile  a  list  of  desirable 
traits  of  teachers,  another  problem  would  arise  in  trying 
to  determine  the  relative  weights  of  each  trait  which  would 
make  up  the  cumulative  measure  of  teacher  effectiveness. 

Gage  and  Orleans  believe  that  teacher  effectiveness 
is  not  to  be  considered  as  consisting  of  any  single  fixed 
pattern  of  traits  (4,  pp.  294-298).  In  any  given  pattern 
that  produces  desired  effects  on  pupil  behavior,  the  ele¬ 
ments  may  vary  and  may  be  compensatory.  Thus,  with  modi¬ 
fications  a  different  pattern  may  still  produce  effective 
results.  Common  elements  may  not  be  present  in  effective 
teachers.  This  led  Orleans,  et  al.  to  conclude  that  look¬ 
ing  for  common  elements  in  competent  teachers  was  unreal¬ 
istic.  They  suggested  that  the  problem  was  one  of  being 
able  to  predict  what  kind  of  teachers  would  be  effective 


IX 

-  ■  ■  .  . .  :  ft  .  :  v ,  i  .  .■  ocri 

•  v  •  -  /  5  9  0  >  .  •  . .  ■  {  9  v  1.  9  i 

.  w  J  -o  ■  ■  ■  .:  -j  .  •  ... 

-•  ■'  v.  •-  C  v  '  .  .  [£. 

•  -•  o  J  ••  .  ,\.  ,  v  ;  '  ..  .  .  c 

■  jl  •  -  ..  .  . .i  ;I 

.  -•  CJ  ,  . 

i  v  ‘  .  •  isQLi  ,  ■  j  .... 

•  I  •  v  ,  :  ,  .  .  *j. 

- .  '_■!  V.'  ...  J  .  ,  :  ^  ,i«  i)l  yj  J  j  .....  J  K,  ;  I  1  ‘ 

.  -  -'v  •:  . , •  '  o.  s. 

•  • '  •  -  r  .  -  .  j.--:  ■.  ■.:  ‘  jj  )•.:  ■:  - 

[ 

•  •  . fi  .  i  moo 

(S.'-i  tOi) 

■  -  -■  v>  -  .  -j  .  . .  ■.  ... 

.  ■  ■  ■  , 

:  '  •  ■ . .  ■  ■  .  .  • .  , 

•  ■  ■  •  ■-  ’•  '  .  .  .  • 

*w  *  v- ,-'1'  -•  :  •  •  w  .  •  .  .  ;  ■  j.  \  ) 

■  •  ■  3  ■  -  ■  •  •  j  rt  j  ....  o  .....  Co  c  n 

■  • .  -  .  -  ; 

•  -  -  •  -  •  ■  59&  BOOjj 

'  '  .....  01  .  '  .  ,  .  .  ... 
' 

. •  '  ,  .......... 

"  ’  '  .  .  :  ,  ...  •  *r< 

’  r5:  •'  -  *  V  JjDi.J-  hoJ  ;o 

•  j.Joj'./u  ■)  o  ;  DS...  ;  3--,  o..  ..j-  lo  b.  I...  tali*  . 


. 


.  .  o  i , 


•:  -,v  3  :.j 


12 


with  what  kinds  of  pupils  for  promoting  change  toward  what 
kinds  of  goals  and  in  what  kinds  of  situations  (11,  p.  646). 
A  similar  belief  was  expressed  by  Ryans  in  reporting  that, 

.  .  .  The  group  also  agreed  that  to  define  the 
effective  teacher  as  a  unitary  concept  was  impossi¬ 
ble  and  that  we  had  better  also  forget  the  concept 
of  an  "ideal"  teacher.  Positively,  we  should  recog¬ 
nize  that  there  may  be  effective  teachers  who  may 
differ  markedly  among  themselves.  Perhaps  the 
problem  should  be  studied  narrowly:  "What  is  effect¬ 
ive  teaching  in  French  or  beginning  arithmetic?" 

.  .  .  (l6,  p.  240) 

Thayer  expressed  doubt  about  studying  a  teacher  by 
breaking  his  general  effectiveness  into  parts  when  he  says, 
".  .  .  and  the  whole  of  a  teacher’s  effectiveness  may  be 
greater  or  less  than  the  sum  of  the  values  assigned  to  his 
analyzed  parts."  (21,  p.  370) 


Teacher  Effectiveness  Studied  through  the  Performance  of 
Teachers 

The  assumption  with  this  approach  has  been  that 
knowledge  of  specific  performances  of  teachers  which  are 
considered  effective  will  allow  accurate  prediction  with 
regard  to  pupil  growth.  Superiors  have  rated  certain  per¬ 
formances  of  teachers  in  the  classroom  as  being  effective 
or  ineffective.  Pupils  have  indicated  what  performances 
they  "like"  and  "dislike." 
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One  fault  of  this  approach  has  been  the  combining  of 
the  description  of  the  performance  with  the  judgment  of  that 
performance  as  being  effective  or  ineffective.  Research 
that  telescopes  these  two  steps  never  gives  a  complete  pic¬ 
ture  of  what  the  teacher  does.  Rather  than  describing  per¬ 
formance  and  then  making  a  value  judgment  about  the  teacher’s 
effectiveness,  persons  have  tended  to  describe  only  those 
actions  which  they  like  or  dislike. 

Teacher  Effectiveness  Studied  through  the  Behavior  of 
Teachers 

Specific  reference  to  "behavior"  was  found  in  much  of 
the  more  recent  literature  on  teacher  effectiveness.  Behavior 
as  a  psychological  term  refers  to  the  individual’s  overt  or 
covert  reaction  to  stimuli  and  there  is  no  connotation  of 
being  "good"  or  "bad." 

Mitzel  contends  that  the  crucial  area  for  the  study  of 
teacher  effectiveness,  ".  .  .is  not  the  selection  and  mea¬ 
surement  of  appropriate  criteria,  but  the  identification, 
isolation  and  measurement  of  the  classroom  behaviors  of 
pupils  and  teachers."  (9,  p.  l)  Much  study  has  been  con¬ 
cerned  with  the  relationship  of  individual  teacher  factors, 
such  as  character  traits,  education  and  experience,  to  that 
of  pupil  growth  while  ignoring  the  teacher's  behavior. 
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Thus*  inconsistencies  have  arisen.  Levin  gave  an  example 
from  a  study  of  teacher  effectiveness  by  Davis  where  teachers' 
behavior  was  overlooked  and  the  results  were  perplexing.  In 
this  study  the  achievement  of  two  groups  of  secondary  school 
pupils  was  compared.  The  pupils  in  one  group  had  teachers 
with  neither  a  major  nor  minor  in  the  subjects  they  were 
teaching.  In  the  second  group  the  pupils  had  teachers  with 
a  major  or  minor  in  the  subjects  they  were  teaching.  The 
former  group  of  pupils  scored  higher  on  achievement  tests, 
and  one  of  the  reasons  could  be  that  the  study  by-passed  the 
actual  behavior  of  the  teachers  in  the  classroom  {j,  p.  103). 

Ackerman  says: 

Learning  is  a  change  in  behavior.  A  teacher  is 
effective  when  he  does  things  or  behaves  in  ways 
that  engender  the  learning  of  skills,  understand¬ 
ings,  work  habits,  desirable  attitudes  and  adequate 
personal  adjustment  on  the  part  of  the  pupils  or 
students.  Looked  at  from  this  point  of  view  neither 
pupil  change  nor  teacher  effectiveness  is  a  unitary 
concept.  Each  admits  of  a  number  of  possibilities. 

...  (x,  p.  284) 


A  trait,  characteristic,  or  quality  is  a  mean¬ 
ingless  concept  unless  it  can  beanchorable  to  some 
sort  of  denotable  behavior.  .  .  .  (l,  p.  286) 


The  observation  of  classroom  behavior  becomes, 
therefore,  a  crucial  step  in  the  entire  process. 

It  is,  perhaps,  the  only  valid  appraisal  of  teacher 
factors  that  can  be  made  .  .  .  behaviors  are  the 
crucial  point  in  the  conditioning  of  pupil  change. 

(1,  p.  286) 
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Gage  and  Orleans  (4),  Orleans  et  al.  (ll),  Melby  (8) 
and  Hughes  (6)  all  drew  attention  to  the  need  for  the  study 
of  teacher  behavior  patterns  in  research  in  the  field  of 
teacher  effectiveness. 

Beecher  ( 2 ,  pp.  64-78)  developed  a  scale  for  rating 
effective  teacher  behavior.  The  components  of  the  scale  were 
characteristics  based  on  what  pupils  say  they  like  in 
teachers.  The  categories  used  were  indications  of  fairness, 
cheerfulness,  sympathetic  understanding,  control,  ability  to 
get  pupil  response,  knowledge  and  skill.  The  criterion 
judges  were  considered  experts  by  their  colleagues.  The 
scores  on  the  behavior  scale  showed  a  correlation  of  .88  with 
ratings  of  the  same  teachers  by  the  criterion  judges.  Beecher 
believed  the  correlation  indicates  a  high  degree  of  validity 
of  the  instrument.  Reliability  also  appeared  high  when 
tested  by  repeated  ratings  and  also  by  other  raters.  The 
author  concluded  by  stating,  "...  the  findings  justify  the 
conclusion  that  valuable  appraisal  evidence  can  be  obtained 
on  the  basis  of  classroom  behavior  to  which  research  indi¬ 
cates  pupils  react  most  frequently."  (2,  p.77)  There  was 
some  indication  that  the  scale  recorded  teacher  character¬ 
istics  rather  than  overt  behavior.  The  choice  of  the  areas 
according  to  pupil  "likes"  and  "dislikes"  restricts  effect¬ 
ive  behavior  to  impressions  of  pupils.  There  was  no 
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indication  that  the  six  behavior  areas  or  dimensions  were 
independent.  Some  of  the  dimensions  may  be  parts  of  other 
dimensions  in  the  set  or  may  be  parts  of  dimensions  not 
included  in  the  set  of  six. 

The  evidence  above  supported  the  contention  that  study 
of  teacher  behavior  might  be  a  more  worthwhile  approach  to 
the  study  of  teacher  effectiveness. 

II.  EVALUATING  TEACHER  EFFECTIVENESS 

Since  an  integral  part  of  this  study  involved  the 
rating  of  a  teacher* s  effectiveness,  some  of  the  literature 
on  this  subject  is  reported. 

Ratings  provide  the  most  common  type  of  quantitative 
appraisal  of  teacher  effectiveness.  They  may  be  direct  com¬ 
parisons  which  involve  ranking  or  pairing  or  they  may  involve 
scales.  The  rating  may  be  based  on  teacher  traits,  teacher 
performances,  pupil  results,  or  a  combination  of  these. 

In  1921,  Rugg  (15^  p.  425)  described  some  very 
extensive  research  concerning  the  rating  of  human  character. 
He  believed  that  rating  on  a  point  scale  was  possible  if  the 
final  rating  was  an  average  of  at  least  three  independent 
ratings,  if  the  scales  were  comparable  and  equivalent,  and 
if  the  three  raters  were  competent  by  being  thoroughly 
acquainted  with  the  individual  being  rated.  This,  he  believed. 
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was  unattainable  in  public  schools.  In  work  done  with  army 
officers  during  World  War  1,  he  reports,  ”,  .  .  it  was  very 
improbable  that  an  officer  was  located  within  even  his  proper 
•fifth’  of  the  entire  scale  by  an  ’official’  rating.  The 
ratings  were  practically  valueless.”  (15,  p.  436)  Later  Rugg 
worked  on  a  more  elaborate  experiment  using  a  "man-to-man" 
comparison.  Here  the  rating  was  made  by  comparing  the  person 
to  be  rated  against  a  standard  determined  by  a  select  prior 
ranking.  nThe  chances  can  not  be  more  than  4  to  1  that  any 


rating  will  be  within  14  points 


true  rating."  (15,  p.  488) 

The  persons  most  thoroughly  acquainted  with  the 
teacher's  work  are  the  students.  Beecher  reported  Bryan's 
study  of  "Pupil  Rating  of  Secondary  School  Teachers"  which 
was  an  early  attempt  to  include  the  pupils  as  the  raters. 

He  reported  the  correlation  between  students'  and  adminis¬ 
trators’  ratings  of  the  same  teacher  was  .68  (2,  p.  18) . 
Beecher  summarized  some  of  the  studies,  which  used  the  pupil 
as  rater,  by  noting,  ".  .  .  it  appears  that  both  junior  and 
senior  high  school  pupils  can  point  out  specific  strong  and 
weak  spots  in  teaching  to  a  degree  that  makes  pupil  ratings 
worthwhile."  (2,  p.  18) 


The  relationship  between  general  ratings  of  a 
teacher’s  effectiveness  and  pupil  results  was  studied  by 
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McCall.  Shane  and  Yauch  reported  that  McCall’s  study  showed 
no  statistically  significant  evidence  of  relationships  between 
pupil  progress  and  ratings  by  principals*  ratings  by  peers* 
and  self-ratings  (l8*  p.  157). 

Criticisms  of  ratings  of  teacher  effectiveness  have 
been  frequent.  One  area  of  such  criticism  has  been  the 
standard  against  which  the  teacher  would  be  judged.  The  most 
common  standard  has  been  a  fuzzy  picture  of  the  "ideal" 
teacher.  Biber  believes*  however*  that  it  is  futile  to  mea¬ 
sure  a  teacher  against  a  fixed  point  of  the  "ideal"  teacher. 
She  considers  the  effective  teacher  in  terms  of  a  possible 
range  within  which  satisfactory  teaching  practices  may  fall. 
Outside  both  extremities  of  this  range  are  practices  which 
are  considered  unsatisfactory  (3*  p.  216) .  As  a  standard  the 
"ideal"  teacher  is  limited  due  to  the  great  amount  of  vari¬ 
ability  in  the  definitions  of  "ideal." 

Thayer  criticized  ratings  because  they  are  too  super¬ 
ficial.  He  states  that  a  rating* 

.  .  .  serves  to  give  one  the  first  over-view  of 
the  teaching  performance*  a  general  and  superficial 
job  analysis,  rather  than  a  vital  and  profound 
analysis  of  the  teaching  act.  .  .  .  (2*  p.  376) 

Rugg  believed  that  we  rate  individuals  in  terms  of 
general  mental  attitude  toward  them.  This  is  called  the 
"halo"  effect.  This  mental  attitude  toward  the  individual 
dominates  judgment  of  particular  qualities  (15*  p.  37). 
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This  belief  points  out  the  disadvantages  of  itemized  rating 
scales.  Rugg  gave  an  illustration  of  this  theory  when  he 
reported  on  a  captain  who  was  rated  the  lowest  of  twenty  by 
thirteen  officers  on  all  categories  of  the  scale  such  as 
physical*  intellectual*  and  leadership  qualities.  Yet*  on 
an  objective  intelligence  test*  the  captain  scored  the  highest 
of  the  twenty  people  rated  and  he  had  been  a  Rhode’s  scholar. 
The  opinions  of  the  rating  officers  of  the  captain’s  general 
personal  quality  influenced  their  opinion  on  the  specific 
categories  (15*  pp.  37*  38).  A  similar  criticism  has  been 
voiced  about  students'  ratings  of  teachers.  Their  sometimes 
shallow  ’’likes"  and  "dislikes"  influence  their  rating  of  the 
teacher  on  specific  items. 

Another  area  of  criticism  was  the  lack  of  objectivity 
of  the  ratings.  A  common  belief  is  that  teaching  is  an  art 
and  many  of  the  results  are  intangible.  Therefore*  to  rate 
teaching  objectively  is  difficult*  and  perhaps  impossible. 
Young  notes  that*  "Any  rating  scale  is  largely  subjective 
even  when  developed  to  the  greatest  extent  and  operated  with 
extreme  care."  (22*  p.  j6) 

Shane  and  Yauch  state: 

...  It  became  apparent  in  a  few  years  that 
"improved"  rating  scales  were  simply  a  refinement 
of  an  original  error.  If  the  basis  for  judgment 
was  to  be  human  subjectivity*  it  made  little 
difference  whether  or  not  the  instrument  for  the 
collection  of  evidence  was  objectively  constructed: 
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the  value- judgments  remained  subjective.  After 
the  evidence  was  in,  it  still  had  to  be  evaluated 
by  fallible  humans.  (l8,  p.  l4l) 

Some  of  the  disadvantages  of  subjective  ratings  are 
that  they  tend  to  be  unreliable,  they  do  not  allow  the  rater 
to  give  a  detailed  explanation  for  the  score  he  gives  the 
teacher,  and  they  give  no  basis  for  the  teacher  to  analyze 
his  weaknesses  and  improve.  If  ratings  are  a  supervisory 
device,  the  latter  two  criticisms  are  probably  valid,  but 
when  ratings  are  used  for  research,  these  criticisms  are  not 
relevant  as  there  is  no  attempt  made  to  help  the  individual 
teacher.  The  tendency  to  be  unreliable  and  the  difficulty 
in  determining  reliability,  however,  are  important  criti¬ 
cisms  of  subjective  ratings. 

Thayer  supported  the  viewpoint  that  a  global 
impression  or  a  more  subjective  evaluation  of  teacher 
effectiveness  was  superior  when  he  stated: 

...  We  now  make  bold  to  suggest  general 
impression  as  the  most  effective  measure  of  a 
teacher’s  worth.  (21,  p.  377) 


.  .  .  This  general  impression  is  a  general¬ 
ization  based  upon  intimate  contacts  with  a 
teacher's  performances.  .  .  .  When  a  supervisor 
works  intensively  with  his  teachers  in  the 
solution  of  their  problems  and  in  the  under¬ 
taking  of  new  ventures,  he  will  know  their 
strength  and  their  weaknesses  more  accurately 
than  is  possible  from  a  few  observations  with 
a  score  card.  (21,  p.  378) 
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Rose  summarized  the  difficulties  encountered  with 
ratings  as  being:  problem  of  untrained  raters *  halo  effect* 
tendency  to  judge  high  or  low  and  the  reluctance  to  use 
extremes*  problem  of  lack  of  reliability  between  raters* 
problem  of  weighting*  effect  on  rater  of  the  order  of  the 
teachers  to  be  rated*  the  usual  dependence  on  memory  when 
giving  a  rating*  and  the  fact  that  the  rater  is  performing 
two  operations  at  one  time-recording  and  judging  (l4* 
p.  233). 

III.  PERCEPTION  OP  BEHAVIOR  AND  JUDGMENT  OF  EFFECTIVENESS 


Since  perception  and  judgment  were  essential  processes 
in  operation  in  this  research*  it  was  considered  necessary 
to  describe  some  of  the  literature  pertinent  to  these 
processes . 

Both  the  description  of  behavior  and  the  rating  of 
effectiveness  imply  that  the  judges  perceive  the  teacher 
within  a  certain  frame  of  reference  that  they  have  developed. 
Sherif  and  Cantril  state  this  very  clearly  in  the  following 
quotation: 

Irrespective  of  the  particular  way  in  which  an 
individual  acquires  an  attitude  in  social  life* 
the  literature  of  social  psychology  is  rife  with 
data  which  support  the  formulations  reached  from 
our  survey  of  general  psychology:  that  perception 
and  judgment  are  selective  and  occur  within  a 
referential  framework*  that  frames  or  points  of 
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reference  are  inevitably  established  if  an 
individual  repeatedly  faces  the  same  stimulus 
situation,  that  these  frames  and  points  of 
reference  are  by  no  means  always  confined  to 
consciously  accepted  instruction  or  imposed 
norms  but  can  become  established  without  the 
individual’s  realization  of  it,  and  that  once 
established  these  frames  and  points  of  reference 
serve  as  anchorages  for  perception  and  judgment. 

(19,  p.  6o) 

From  a  review  of  many  studies  Scott  notes  the  follow¬ 
ing  four  generalizations  about  perceptions: 

1.  That  which  an  individual  perceives  when 
confronted  with  a  stimulus  situation  may  not 
correspond  to  "objective  reality."  .  .  . 

2.  Several  individuals  confronted  with  the 
same  stimulus  situation  may  perceive  (i.  e., 
structure  and  interpret)  the  same  situation 
quite  differently.  .  .  . 

3.  The  manner  in  which  an  individual 
perceives  a  stimulus  situation  is  affected 
biological,  psychological,  social,  and 
cultural  factors.  .  .  . 

4.  The  individual  responds  to  his  environ¬ 
ment  in  terms  of  his  perceptions  of  that 
environment.  .  .  .  (17*  P.  8,  9) 

With  regard  to  the  ability  to  judge  others,  Taft 
provided  evidence  that  this  may  be  both  a  general  ability, 
and  a  specific  ability.  The  latter  ability  would  vary  with 
the  type  of  judgment  to  be  made  and  with  the  situation. 

The  main  attributes  of  judging  others  seemed  to  lie  in  three 
areas:  possessing  appropriate  judgmental  norms,  judging 

ability,  and  motivation.  The  development  of  appropriate 
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norms  is  dependent  upon  the  similarity  of  background  of  the 
judge  and  judged,  and  also  on  the  proximity  of  the  judge  to 
the  judged  in  regard  to  their  working  relations  (20,  p.  20). 
Taft  believed  that  his  investigation  showed  that  motivation 
was  probably  the  most  important  area  affecting  judgments. 

IV.  SUMMARY 

The  literature  reviewed  described  some  of  the 
approaches  that  have  been  used  to  study  teacher  effective¬ 
ness.  The  area  of  teacher  behavior  seemed  to  have  been 
neglected  and  appeared  to  be  the  most  fruitful  area  for 
research  on  this  topic. 

Many  studies  have  been  reported  on  rating  of  teacher 
effectiveness.  A  summary  of  some  of  the  criticisms  of  the 
use  of  ratings  indicated  that  global  impressions  by  judges 
in  close  contact  with  the  teacher  might  be  useful,  and  that 
there  are  many  difficulties  in  using  ratings  of  teacher 
effectiveness. 

Perception  of  behavior  is  determined  by  the  frame  of 
reference  that  the  judge  has  developed  and  judgments  of 
others  appeared  to  be  related  to  judging  ability,  develop¬ 
ment  of  appropriate  norms,  and  motivation  of  the  judge. 
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CHAPTER  III 


THEORY  AND  METHODOLOGY 

I.  FRAMEWORK  TO  STUDY  TEACHER  EFFECTIVENESS 

A  research  pattern  that  includes  the  important  variables 
which  are  related  to  teacher  effectiveness  will  serve  as  a 
framework  within  which  theories  can  be  hypothesized  and  tested. 
Levin  believes  there  is  lack  of  a  conceptual  framework  for 
research  on  this  subject.  He  states  that  a  framework  provides 
an  operational  philosophy  (8,  p.  101).  Kinney  agrees  with 
this  when  he  says  that  we  need  to,  "...  Provide  a  conceptual 
framework  to  aid  in  setting  up  hypotheses  about  teacher  behavior 
and  related  variables.11  (7 >  p.  3) 

For  the  purpose  of  this  study,  the  framework  suggested  by 
Mitzel  (9,  p.  l)  was  used.  He  postulated  that  there  are  four 
types  of  variables  that  are  related  to  teacher  effectiveness. 

Type  I  variables  are  characteristics  of  teachers.  These 
are  variables  concerned  with  the  individual's  personality,  his 
background  and  his  education.  They  differ  for  almost  every 
teacher,  but  can  be  hypothesized  as  having  some  relation  to 
teacher  effectiveness.  They  are  called  prediction  sources. 

Type  II  variables  are  factors  from  the  environment  of 
the  school  situation.  They  influence  the  complex  behavior 
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patterns  that  are  part  of  the  educational  process.  These 
situational  variables  would  include  environmental  factors 
such  as  the  community,  and  factors  within  the  school  such 
as  the  school  plant  and  school  organization.  These  vari¬ 
ables  would  also  include  pupil  factors  such  as  attitudes, 
interests  and  abilities.  The  pupil  factors  are  situational 
factors  that  affect  teacher  and  pupil  behavior  just  as 
teacher  variables.  Type  I,  affect  teacher  and  pupil  behavior. 

Type  III  variables  deal  with  behaviors.  These 
behaviors  of  the  teachers  and  pupils  are  expressions  of  the 
individuals  in  action  within  a  certain  situation.  Teacher 
behavior  is  found  in  the  classroom,  in  the  school,  in  the 
community  and  in  the  profession.  The  focal  point  is  the 
interaction  of  teacher-pupil  behavior  in  the  classroom. 

Mitzel  states,  "Considering  that  classroom  behaviors  bear 
such  heavy  responsibilities  in  determining  educational  out¬ 
comes,  remarkedly  little  is  known  about  them  or  their 
effects."  ( 9}  p.l) 

Type  IV  variables  are  the  standards  with  which  the 
results  of  education  are  judged.  The  term  intermediate 
educational  goals  is  used  to  distinguish  them  from  ultimate 
goals  found  in  the  community.  Such  factors  as  pupil  growth 
in  subject  matter  knowledge,  in  appreciations,  and  in 
social  skills  are  some  of  these  variables. 
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Figure  I  gives  a  diagram  of  this  pattern.  Solid  lines 
indicate  direct  effects,  and  broken  lines  indicate  indirect 
effects,  although  the  distinction  between  direct  and  indi¬ 
rect  might  be  questionable. 

In  this  pattern,  teacher  variables  and  pupil  variables 
are  direct  determinants  of  teacher  and  pupil  behavior. 
Environmental  variables  indirectly  influence  both.  The 
crucial  area  is  the  interaction  of  teacher  and  pupil 
behavior. 

Levin  states  that  to  have  a  complete  picture  of 
teacher  effectiveness  one  would  predict  from  the  antecedent 
conditions  to  the  classroom  behavior  and  in  turn  from 
behavior  to  classroom  effects  (8,  p.  102). 

II.  TEACHER  BEHAVIOR 

From  Chapter  II  and  the  above  section  the  crucial  area 
in  the  study  of  teacher  effectiveness  appeared  to  be  teacher 
behavior.  The  initial  step  in  this  study  was  to  determine 
dimensions  of  teacher  behavior  which  would  be  to  some 
extent  independent  of  each  other,  and  would  show  promise 
of  being  related  to  teacher  effectiveness. 

Halpin  (4,  p.  4)  suggested  that  the  two  dimensions  of 
Consideration  and  Initiating  Structure,  which  have  been 
found  to  be  valid  measures  of  leader  behavior,  might  be 
characteristic  of  teacher  behavior  in  relation  to  pupils. 
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Figure  1  Framework  for  Research  in  Teacher  Effectiveness 
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He  states. 


.  .  .  This  LBDQ  technique  has  not  as  yet  been 
used  in  studies  of  classroom  teachers,  but  there 
is  strong  presumptive  evidence  to  suggest  that 
these  same  two  dimensions  are  of  special  signifi¬ 
cance  in  the  interaction  between  teachers  and 
students.  (4,  p.  175) 

Leader  behavior  is  described  on  the  Ohio  Leader  Behavior 
Description  Questionnaire  (LBDQ)  and  research  has  shown  that 
the  dimensions  are  applicable  to  aircraft  commanders,  school 
superintendents,  university  administrators,  factory  foremen 
and  school  principals.  The  definitions  of  the  two  dimensions 
described  on  the  LBDQ  are  as  follows: 

Initiating  Structure-in-Interaction  refers  to 
the  leader’s  behavior  in  delineating  the  relation¬ 
ship  between  himself  and  the  members  of  his  group, 
and  in  endeavoring  to  establish  well-defined  patterns 
of  organization,  channels  of  communication,  and  ways 
of  getting  a  job  done.  The  leader  establishes 
’’Structure"  in  the  way  the  group  members  interact 
with  him  and  with  each  other  so  that  whenever  the 
group  is  confronted  with  a  novel  problem,  the 
members  can  resort  to  these  Structures-in- Interaction 
to  facilitate  the  solution  of  group  problems.  Hence 
the  group  members  are  not  dependent  upon  the  leader 
for  fresh  and  specific  instructions  on  how  to  handle 
each  new  problem  that  arises.  (4,  p.  174) 

Consideration  refers  to  behavior  that  reflects 
friendship,  mutual  trust,  respect  and  warmth  in 
the  relationship  between  the  leader  and  group 
members.  This  represents  the  human  relations 
aspect  of  leader  behavior.  (4,  p.  175) 

It  can  be  assumed  that  these  dimensions  are  part  of 
teacher  behavior  since  they  are  valid  dimensions  of  any  leader 
behavior.  Teachers  in  the  classroom  are  closely  equated  to 
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leaders  of  a  productive  group,  where  goal  accomplishment  or 
getting  a  job  done,  and  human  relations  are  important  aims 
of  this  leader. 


III.  JUDGES  OF  TEACHER  LEADER  BEHAVIOR 
AND  TEACHER  EFFECTIVENESS 


The  LBDQ  provided  an  instrument  through  which  general 
leader  behavior  could  be  described. 

Although  the  dimensions  of  Consideration  and  Initiat¬ 
ing  Structure  might  be  characteristic  of  teacher  behavior, 
it  was  essential  that  some  indication  of  the  extent  of  their 
relationship  to  effective  teaching  be  obtained.  This 
required  criteria  with  which  to  judge  teacher  effectiveness 
and  the  most  common  criteria  are  the  judgments  of  superiors. 
The  Encyclopedia  of  Educational  Research  states: 

.  .  .  The  criterion  [of  teaching  succes£]  most 
frequently  used  in  studies  concerned  with  pre¬ 
service  selection  of  teachers  is  the  judgment  of 
persons  in  the  field,  such  as  superintendents, 
principals,  departmental  supervisors,  and  teaching 
colleagues.  .  .  . 


Authorities  are  in  quite  general  agreement  that 
the  judgment  of  experts  is  the  best  available 
criterion  of  teaching  success.  .  .  .  (10,  p.  1391) 

The  selection  of  judges  to  rate  teaching  effectiveness 

was  based  on  several  factors.  Firstly,  they  should  have 
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been  in  close  contact  with  the  teacher  over  a  period  of 
time.  Secondly,  they  should  include  viewpoints  that  would 
give  as  complete  a  picture  of  the  teacher  as  possible.  The 
numerous  judges  would  include  the  opinions  of  those  who  had 
direct  contact  with  the  teacher  in  the  process  of  teaching. 
Difficulties  in  judging  effectiveness  have  been  noted  in 
Chapter  II,  but  some  of  the  criticism  was  regarding  the  use 
of  such  evaluative  judgments  for  purposes  other  than 
research.  The  criterion  judge  is  in  no  way  affecting  the 
future  of  a  teacher  or  affecting  his  personal  relations 
with  the  teacher  when  he  makes  a  judgment  that  is  to  be 
used  solely  for  research,  and  is  therefore  more  likely  to 
indicate  his  true  opinion. 

To  describe  the  teacher's  behavior  students  would  need 
to  be  included.  On  the  LBDQ  the  main  description  of  leader 
behavior  was  provided  by  subordinates,  and  students  are  the 
only  ones  filling  this  role  in  relation  to  teachers.  Levin 
states, 

.  .  .  prom  work  done  at  Harvard]  the  evidence 
is  clear  ohat  there  is  agreement  among  pupils  in 
a  classroom  as  to  how  the  teacher  acts.  Method¬ 
ologically,  this  gives  an  important  tool  for  the 
measurement  of  teacher  behavior.  In  addition, 
there  is  some  evidence  that  principals  and  pupils 
tend  to  confirm  each  others  judgments  about  the 
teacher.  (8,  p.  104) 

Ryans  says  that  the  teacher's  demonstration  of  effect¬ 
iveness  (his  effective  behavior)  may  be  measured  by  means 
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of  objective  tests  and  the  judgments  of  various  kinds  of 
persons  such  as  the  teacher  himself,  fellow-teachers,  pupils, 
superintendents,  principals,  supervisors  or  lay  persons 
(ii,  p.  258). 

To  describe  and  rate  the  teacher  it  was  believed  that 
students,  other  teachers  on  the  same  staff,  principals, 
superintendents  and  the  teachers  themselves  would  include 
all  those  persons  who  have  direct  contact  with  the  teacher. 
The  superintendent  would  have  had  the  most  experience  in 
evaluating  teachers  and  the  students  would  have  the  least 
specific  ability  to  judge  the  teacher's  effectiveness  as 
Taft  (12,  p.  20)  has  shown  that  age  in  children  is  directly 
related  to  the  ability  to  judge  others. 

Students  would  have  had  the  most  contact  with  the 
teacher  and  thus  would  be  more  familiar  with  the  teacher's 
behavior  while  superintendents  would  be  less  familiar  with 
the  teacher's  everyday  behavior. 

IV.  HYPOTHESES 

The  following  were  the  three  major  hypotheses  which 
resulted  from  the  theoretical  design  discussed  in  this 
chapter. 

1.  Students  and  superiors  (principals  and  superin¬ 
tendents)  rate  those  teachers  highest  in  teaching 
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effectiveness  who  score  above  the  median  on  both  of  the 
leader  behavior  dimensions.  Consideration  and  Initiating 
Structure.  Students  and  superiors  rate  those  teachers  lowest 
in  teaching  effectiveness  who  score  below  the  median  on  both 
of  the  leader  behavior  dimensions. 

2.  Students  tend  to  relate  effectiveness  of  teaching 
with  teachers’  scores  on  both  Consideration  and  Initiating 
Structure,  but  Consideration  is  more  closely  related. 

3.  Superiors  tend  to  relate  effectiveness  of  teaching 
with  teachers’  scores  on  both  Consideration  and  Initiating 
Structure,  but  Initiating  Structure  is  more  closely  related. 

V.  PROCEDURE  FOR  TESTING  HYPOTHESES 

In  general  the  procedure  to  test  the  hypotheses  was  to 
describe  leader  behavior  and  rate  effectiveness  of  teachers 
through  the  use  of  two  instruments.  These  two  instruments 
were  to  be  administered  to  students,  other  teachers  on  the 
same  staff,  principals,  superintendents,  and  the  teachers 
themselves  in  schools  in  Alberta.  This  section  outlines  the 
instruments  used  and  the  design  of  the  research  to  test  the 


hypotheses . 
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Instruments 

Both  of  the  instruments  used  were  adaptations  of  other 
instruments.  The  instruments  were  identical  for  all  sets  of 
raters,  but  five  colours  were  used  to  facilitate  analysis. 
Both  instruments  were  printed  on  the  same  four  page  booklet, 
with  the  behavior  description  being  completed  first,  fol¬ 
lowed  by  the  rating  scale  which  was  on  the  back  page. 

Global  rating  scale  of  teacher  effectiveness .  This 
very  short  instrument  (Appendix  A,  p.  124)  was  adapted  from 
Andrews  (l,  p.  167) . 

The  instrument  gives  a  global  rating.  This  means  that 
a  single  rating  gives  an  overall  picture  or  general  impres¬ 
sion  of  the  individual’s  opinion  of  the  effectiveness  of  the 
teacher. 

The  rating  was  applied  only  to  the  teaching  of  students 
and  the  directions  mentioned  not  to  include  other  aspects  of 
the  teacher’s  work.  This  complies  with  the  limitation  of 
effectiveness  stated  in  Chapter  I.  The  directions  state, 
’’Please  check  the  statement  which  in  your  judgment  best 
indicates  how  well  this  teacher  does  in  teaching  his 
students.”  (Appendix  A,  p.  124) 

The  rater  was  to  use  as  his  standard  the  average 
teacher  of  the  particular  school  in  which  he  was  at  the 
time  of  this  administration.  This  was  accomplished  by  adding 
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the  words  nfor  this  school”  to  each  of  the  six  categories. 
This  was  an  attempt  to  reduce  the  variability  among  raters 
and  to  avoid  the  concept  of  the  "ideal"  teacher. 

The  ratings  ranged  from  a  possible  "l.  Outstanding 
for  this  school”  to  ”6.  Very  low  for  this  school."  Wandt 
and  Ostreicher  (13>  p.  6)  quote  Symonds  as  believing  that 
seven  is  the  optimal  number  of  scale  units  for  ratings. 

This  scale  has  six  since  the  opportunity  to  choose  a  middle 
or  average  rating  has  been  eliminated. 

This  instrument  was  used  as  a  validity  criterion  for 
the  other  instrument,  the  behavior  description  question¬ 
naire,  and  as  such  its  own  validity  is  not  measured. 

The  reliability  of  the  rating  scale  presented  a 
problem.  With  this  instrument  reliability  could  be  measured 
only  by  the  test-retest  method.  This  would  require  that 
the  names  of  the  raters  be  recorded  on  the  instruments  so 
that  the  retest  scores  could  be  compared.  To  do  this  would 
destroy  the  anonymity  of  the  raters  and  the  investigator 
believed  this  would  change  the  motivation,  and  attitude  of 
the  raters,  to  such  a  degree  that  it  would  make  the  scale 
much  less  useful.  The  only  slight  indication  of  reliability 
would  be  the  consistency  from  rater  to  rater  for  the  same 
teacher  and  this  is  discussed  in  Chapter  V,  but  this  gave 
no  indication  of  the  reliability  i or  the  same  raters  and 
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Is  considered  a  definite  weakness  in  the  research. 


The  Ohio  Leader  Behavior  Description  Questionnaire 
(LBDQ) .  The  instrument  used  in  this  study  to  describe 
behavior  was  an  adaptation  of  the  LBDQ1. 

The  LBDQ  provided  a  method  whereby  the  designated 
leaders  of  a  formal  organization  might  be  described  by  the 
group  members.  It  specifically  states  in  the  directions 
that  it  is  a  description  of  the  leader's  behavior  and  does 
not  ask  the  respondent  to  judge  whether  the  behavior  is 
desirable  or  undesirable. 

Halpin  states: 

The  LBDQ  was  developed  by  the  staff  of  the 
Personnel  Research  Board,  The  Ohio  State  Uni¬ 
versity,  as  one  project  of  the  Ohio  State 
Leadership  Studies,  directed  by  Dr.  Carroll  L. 

Shartle.  Hemphill  and  Coons  constructed  the 
original  form  of  the  questionnaire;  and  Halpin 
and  Winer,  in  reporting  the  development  of  an 
Air  Force  adaptation  of  the  instrument,  identi¬ 
fied  Initiating  Structure  and  Consideration  as 
two  fundamental  dimensions  of  leader  behavior. 

.  .  .  Initiating  Structure  and  Consideration 
accounted  for  approximately  3^  and  50  percent 
of  the  common  variance.  (3*  P-  l) 

In  a  later  study  using  the  LBDQ,  Halpin  concluded  that  both 
dimensions  were  related  to  aircraft  commanders'  effective¬ 
ness  when  rated  by  crews  and  superiors  (5,  p.  18) . 


■^Permission  to  adapt  was  obtained  from  R.  M.  Stogdill 
Ohio  State  University,  February,  1959- 
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A  considerable  amount  of  research  was  reported  regard¬ 


ing  the  independence  of  the  dimensions.  Hemphill  reported 
correlations  between  the  dimensions  of  .18  for  teachers’ 
descriptions  of  superintendents,  .61  for  board  members’ 
descriptions  of  the  same  superintendent,  .56  for  descriptions 
of  industrial  foremen,  and  .45  in  a  military  situation  (6, 
pp.  92,  93).  Halpin  reported  correlations  between  the 
dimensions  of  .38  and  .51  for  descriptions  of  aircraft 
commanders.  He  stated  that  the  probability  was  very  slight 
that  a  leader  would  be  described  as  extremely  low  or 
extremely  high  on  both  dimensions  (5*  pp.  2,  17). 

The  LBDQ  manual  provides  no  norms,  but  there  are  mean 
scores  which  were  obtained  for  aircraft  commanders  and 
school  superintendents  (3*  p.  10 ) . 

In  using  the  instrument  the  respondent  describes  the 
leader  on  forty  items  by  indicating  the  frequency  with  which 
he  engages  in  each  behavior  by  circling  a  letter  represent¬ 
ing  one  of  five  adverbs,  "always,”  "often,"  "occasionally," 
"seldom,"  and  "never."  It  is  scored  by  assigning  values  of 
4,  3,  2,  1  and  0  to  "always,"  "often,"  "occasionally," 
"seldom,"  and  "never,"  respectively.  Of  the  forty  items, 
ten  are  not  scored  so  that  each  dimension  contains  fifteen 
items . 

In  administering  the  questionnaire.  Initiating 
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Structure  and  Consideration  are  not  mentioned.  With  regard 
to  the  number  of  respondents  per  leader,  Halpin  states. 

Experience  suggests  that  a  minimum  of  four 
respondents  per  leader  is  desirable  and  that 
additional  respondents  beyond  ten  do  not  increase 
significantly  the  stability  of  the  index  scores. 

(3,  P.  7) 

The  Teacher  Leader  Behavior  Description  Questionnaire 
( TLBDQ)  (Appendix  A) .  The  administration,  the  scoring  and 
the  purpose  of  the  adapted  instrument  were  similar  to  that  of 
the  LBDQ  described  above.  The  original  questionnaire  was 
changed  to  make  it  more  suitable  to  a  classroom  situation. 

The  ten  unscored  items  were  discarded.  Thirteen  of  the 
fifteen  Initiating  Structure  items  remained  the  same  as  did 
nine  of  the  fifteen  Consideration  items.  This  is  with  the 
exception  of  the  minor  changes  for  the  use  of  "group"  to 
"class"  and  "group  members"  to  "students."  Of  the  remaining 
eight  items,  some  changes  were  made,  but  with  six  of  them  the 
investigator  tried  to  keep  the  intent  of  the  original  item. 

With  two  items  considerable  change  was  made.  These 
were  items  number  17  and  19  which  were  both  part  of  the 
Consideration  dimension.  The  items  used  were  chosen  from  a 
list  of  behaviors  reported  by  Beecher  (2,  pp.  64-78)  as  being 
considered  important  by  students,  bince  these  items  were  so 
different  from  the  originals,  it  was  considered  necessary  to 
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establish  the  amount  of  agreement  with  the  total  remaining 
Consideration  score.  Item  number  17  gave  a  correlation  of 
.48  (significant  at  the  .01  level,  N  =200)  with  the  total 
Consideration  score  and  item  number  19  correlated  .38 
(significant  at  the  .01  level,  N  =200).  These  were  con¬ 
sidered  sufficiently  high  to  include  them  as  part  of  the 
questionnaire.  These  results  were  obtained  from  the  study 
proper. 

To  establish  the  reliability  a  random  sample  of  100 
questionnaires  was  selected  from  the  989  questionnaires  that 
were  satisfactorily  completed  in  the  study.  This  sample  was 
so  chosen  that  the  ratio  of  students,  to  other  teachers,  to 
principals,  to  superintendents,  to  teachers  themselves  was 
the  same  as  in  the  total  group.  This  sample  of  100  was  used 
to  check  the  reliability.  Using  the  Spearman-Brown  formula 
the  corrected  split-half  reliabilities  were  .75  and  .86  for 
Initiating  Structure  and  Consideration,  respectively.  HaUpin 
(3,  p.  6)  reported  .83  and  .92  for  the  same  dimensions. 
Initiating  Structure  seems  to  have  a  lower  reliability.  The 
investigator  considered  the  reliability  for  the  Initiating 
Structure  dimension  on  the  TLBDQ  as  lower  than  desirable. 
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The  Pilot  Study 

To  examine  the  wording  of  the  items  for  use  with  stu¬ 
dents  a  pilot  study  was  conducted  in  a  junior  high  school 
and  a  senior  high  school  in  Edmonton.  This  involved  nine 
students,  two  teachers  and  a  principal  in  each  of  the  two 
schools.  There  was  no  attempt  made  to  establish  the  validity 
of  the  instrument  in  relation  to  effectiveness  as  it  was  felt 
that  this  was  one  of  the  purposes  of  the  study  proper. 

Only  a  few  minor  changes  were  made  in  the  wording.  It 
was  found  that  students  had  difficulty  with  the  three  nega¬ 
tive  items  which  were  in  the  original  LBDQ.  The  revision 
left  only  one  negative  item  and  here  the  values  assigned  were 
reversed  from  the  usual  method. 

The  investigator  found  that  senior  high  school  students 
expressed  no  difficulty  in  interpreting  and  understanding  the 
items.  Some  of  the  teachers  thought  that  a  few  items  were 
ambiguous . 

In  the  junior  high  school  there  was  more  difficulty 
expressed  by  the  students  and  interrogation  by  the  investi¬ 
gator  indicated  some  variation  in  the  interpretation  of  cer¬ 
tain  items.  This  was  especially  true  with  grade  seven.  For 
this  reason  the  original  intent  to  include  grade  seven  in  the 
study  was  dropped. 
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It  was  difficult  to  establish  objectively,  but  the 
investigator  felt  that  the  descriptions  given  by  junior  high 
school  students  were  less  valid  due  to  misunderstanding  of 
the  terminology  used  in  the  items. 

Plan  for  Collecting  the  Data 

To  test  the  hypotheses  the  two  instruments  were  to  be 
administered  to  the  five  sets  of  respondents.  Thirty-six 
provincially  appointed  superintendents  were  contacted  by 
letter  (Appendix  B)  asking  for  the  cooperation  of  one  school 
in  their  area.  The  schools  were  required  to  have  at  least 
ten  teachers  on  the  staff  and  at  least  seven  teaching  above 
grade  seven.  The  investigator  asked  that  no  school  be 
included  in  the  research  unless  every  member  on  the  staff 
wished  to  take  part. 

The  plan  was  to  use  non-city  schools  as  it  was  believed 
that  the  relationship  between  the  superintendent  and  teachers 
might  not  be  similar  to  the  relationship  between  the  super¬ 
intendent  and  teachers  in  divisions.  Elementary  teachers 
were  not  to  be  included  among  those  to  be  described  or  rated 
as  elementary  pupils  would  not  be  able  to  understand  the 
wording  of  the  TLBDQ. 

The  plan  was  to  obtain  the  cooperation  of  twelve 
schools.  In  each  school  five  teachers,  to  be  the  centre  of 
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the  study,  would  be  chosen  at  random  from  those  teaching 
above  grade  seven.  At  the  same  time  five  other  teachers 
were  to  be  chosen  to  describe  the  behavior  and  rate  the 
effectiveness  of  the  five  teachers  first  chosen.  For  each 
of  the  five  teachers  that  were  the  centre  of  the  study,  ten 
students  would  be  chosen  at  random  from  all  the  students 
taught  by  that  teacher.  The  teachers  themselves,  as  well  as 
their  respective  principals  and  superintendents,  would  also 
describe  and  rate  the  teachers. 

This  plan  would  involve  60  teachers  as  the  centre  of 
the  study  who  would  be  described  and  rated  by  600  students, 
by  60  other  teachers,  by  12  principals,  by  12  superintendents 
and  by  the  teachers  themselves. 

VI.  SUMMARY 

There  is  need  for  a  framework  within  which  to  study 
teacher  effectiveness.  A  pattern  was  suggested  in  which  the 
crucial  area  is  that  of  teacher  and  pupil  behaviors.  In  this 
study  the  area  of  teacher  behavior  is  investigated  and  it 
was  suggested  that  the  two  dimensions  of  teacher  behavior 
which  might  be  related  to  teacher  effectiveness  are  the 
leader  behavior  dimensions  of  Consideration  and  Initiating 
Structure.  The  types  of  judges  to  provide  the  descriptions 
of  behavior  and  the  ratings  of  effectiveness  were  discussed 
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and  the  specific  hypotheses  to  be  tested  were  stated.  The 
procedure  to  be  used  in  testing  the  stated  hypotheses  was 
outlined  by  describing  the  two  instruments  and  the  plans 
for  collecting  data. 
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CHAPTER  IV 


COLLECTION  OF  THE  DATA 

The  data  for  this  research  were  collected  from  twelve 
schools  in  the  Province  of  Alberta.  The  staffs  of  the  twelve 
schools  volunteered  for  the  project.  The  investigator  visited 
each  school  in  April  and  May  of  1959. 

I.  METHOD  OF  COLLECTING  THE  DATA 
The  Teachers  Who  Were  the  Centre  of  the  Study 

At  each  school  five  teachers  were  chosen  from  a  hat  on 
the  day  of  the  visit.  There  were  two  exceptions  to  this  pro¬ 
cedure.  In  one  school  there  were  only  four  teachers  teaching 
above  grade  seven;  thus  the  entire  four  were  used.  In  another 
school  one' teacher  volunteered  to  change  places  with  a  teacher 
who  already  had  been  selected.  No  principals  were  included  as 
teachers.  A  total  of  fifty-nine  teachers  were  chosen  as 
described  above  and  constituted  the  core  of  the  study. 

The  five  teachers  received  a  blue  questionnaire  to 
describe  and  rate  themselves.  They  completed  the  question¬ 
naires  during  their  spare  time  and  returned  them  to  the 

investigator  during  the  day. 

Whenever  the  word  "teacher"  or  the  words  "teachers 
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themselves”  are  used  they  refer  to  this  group  of  fifty-nine 
teachers  who  were  the  centre  of  the  study. 

The  Other  Teachers  on  the  Same  Staff 

At  the  same  time  as  the  above  five  teachers  were 
selected,  five  other  teachers  were  chosen  at  random  from  the 
staff.  This  peer  group  was  used  to  describe  and  rate  the 
original  five  teachers.  They  are  referred  to  as  the  ’’other 
teachers”  in  the  remainder  of  the  study. 

Each  other  teacher  received  five  green  questionnaires 
and  completed  them  individually  during  the  day.  They  com¬ 
pleted  one  questionnaire  on  each  of  the  five  teachers. 

The  number  of  other  teachers  varied  from  two  to  five. 
In  one  school  two  were  all  that  were  available.  In  two 
schools  three  were  used.  In  four  schools  four  other  teachers 
were  utilized  and  in  the  remaining  five  schools  five  other 
teachers  from  each  school  completed  the  questionnaire.  The 
average  was  over  four  other  teachers  per  teacher.  Question¬ 
naires  completed  by  the  50  other  teachers  totalled  241.  In 
all  but  one  school  the  other  teachers  were  from  senior  or 
junior  high  school. 

The  Students 


The  plan  was  to  chose  ten  students  to  describe  and 
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rate  each  of  the  five  teachers  in  the  school.  The  selection 
of  the  students  was  carried  out  with  the  aid  of  the  principal. 

For  each  teacher  an  alphabetical  list  was  obtained  of 
all  the  students  that  he  or  she  taught.  The  ten  students 
chosen  were  as  equally  distributed  as  possible  throughout  all 
the  grades  taught  by  the  teacher  while  at  the  same  time  giving 
some  weight  to  the  number  of  pupils  in  each  grade  or  class. 

For  example,  if  a  teacher  taught  grades  ten,  eleven,  and 
twelve  with  more  students  in  grade  ten,  four  students  were 
selected  from  grade  ten  and  three  from  each  of  the  other  two 
grades.  The  students  were  chosen  from  alphabetically  arranged 
lists  using  numbers  predetermined  by  the  investigator.  In  all 
cases  the  students  involved  were  taught  one  or  more  subjects 
by  the  teacher  they  described.  Some  students  had  been  taught 
by  the  specific  teacher  for  a  number  of  years  and  some  for 
just  the  present  year.  In  no  case  did  a  student  describe  more 
than  one  teacher;  once  chosen  for  a  specific  teacher,  they 
were  not  chosen  to  describe  another  teacher. 

There  was  one  exception  to  the  above  random  selection 
of  students.  In  one  class  the  teacher  had  a  disproportionate 
number  of  repeaters  compared  to  the  normal  class.  A  division 
was  made  between  the  repeaters  and  non-repeaters  and  the 
students  were  chosen  at  random  from  each  group.  The  result¬ 
ing  group  of  students  had  fewer  repeaters  than  if  the 


?«.*'•  X  liOjv;  $js: 


■  x  • ' :  *i’ ,  •  j  e  7  ,  , 


-X  '  •  •  1  ■  -  '  •  _  • ;  .  •  .  ■  7.,  V. 

x  -  •  ■  .  ■ 


*0  •-  •  •-  .  n; :J.  . ; ,, ...  .  7  ,  9.  ; , •, 

v  •  •  •  •  -  • .  '  .  . 

' 

uq  leami  ...  .  i  at 

,J  *  \  j.  >L  x  •  ;j  *.  i  .  1 .  ... 

•  '  ■  •  ,..x  •...  . 

■  ‘I  -  M  .  ,  •  .  .  .  ■  3  Vj. 

1  ■  *'  ■  -  •  •  I  ,l  :  . 

.  .  ..  .  ., 

1  0  -  -  •'  ■  \  •  ■  V  J.  •  ■  .  .J  i,..:  ,  0 

•  .  .  •.  ,  . 

.  "  '  '  '  1 

';j  f-'  ''  ^  *  -  •  -  •;  x.:  .  7.  w 

W  .  i.i)£Ki.  1  0-  1  'j  ■■ .  ['d  .  ;  '  X  di>  ’£  ...  ^  •  J  Ji  f 

'  '•  '  •  -  "  ......  JC 

■  •  •  •  <  ■  xi.  c  .  .  • 

' 

•  ^  ;  '  •>*'••  ‘  x  Dli*  X  ",  .  .  ...  J  •  j.  *A  -,i.  !A... 

-•'J-  '  •'  i  h:  \  .si  .d.-.:.  .  c.Jv  k  ; 

•  -  *  ■  '■  •  -  K>i>  |  *1 

x.  •  >‘1  ... ,.j jj  • 


50 


selection  nad  been  made  at  random  from  the  whole  class.  In 
a  few  Instances  the  principal  provided  an  additional  student 
when  one  of  those  previously  selected  was  not  available. 

The  smallest  number  of  students  used  was  7  per  teacher. 
The  total  number  of  students  was  571  which  averages  over  9*5 
students  per  teacher.  Of  this  total  222  were  from  grades 
eight  and  nine  and  the  remaining  349  were  from  senior  high 
school . 

The  students  gathered  in  a  room  and  the  white  question¬ 
naire  was  administered  separately  to  each  group  by  the 
investigator.  The  directions  (Appendix  A)  emphasized  that 
the  teachers  were  chosen  by  chance  and  that  the  results  were 
for  research.  Each  student  completed  the  questionnaire  on 
his  own  and  returned  it  to  the  investigator  before  leaving 
the  room. 

The  Principals 

During  the  day  of  the  visit  the  principal  completed 
one  yellow  questionnaire  on  each  of  the  five  teachers.  Thus, 
there  were  fifty-nine  questionnaires  completed  by  twelve 
principals . 

The  Superintendents 


The  superintendents,  all  provincially  appointed. 
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completed  one  pink  questionnaire  on  each  of  the  five  teachers 
from  the  one  school  in  their  area.  In  all  but  three  cases 
they  mailed  them  to  the  investigator.  There  were  fifty-nine 
questionnaires  completed  by  twelve  superintendents. 

II.  THE  SCHOOLS  AND  TEACHERS  INVOLVED 

The  hypotheses  of  this  study  involved  a  belief  that 
elements  of  teacher  leader  behavior  are  common  to  teachers 
anywhere  and  under  all  circumstances.  Thus  a  discussion  of 
the  breadth  of  coverage  is  considered  helpful  at  this  time. 

A  brief  glance  at  the  schools  and  teachers  involved  showed 
that  the  coverage  was  very  broad. 

City  schools  were  not  used.  Elementary  teachers  were 
omitted  because  it  would  not  be  possible  to  have  elementary 
students  answer  the  questionnaire. 

Geographically  the  schools  covered  many  parts  of  the 
province,  but  in  the  main  were  centred  around  Edmonton.  No 
two  schools  were  in  the  same  inspectorate.  All  schools,  but 
one,  were  in  divisions  and  counties.  No  separate  school 
districts  were  included. 

The  fifty-nine  teachers  studied  covered  a  wide  range 
of  subjects  taught,  experience,  training  and  background. 

There  were  twenty-one  females  and  thirty-eight  males.  Sub¬ 
jects  taught  included  all  academic  courses  plus  home 
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economics,  commercial,  physical  education,  art,  music,  and 
industrial  arts.  Some  of  the  teachers  were  travelling 
specialists.  They  taught  all  grades  from  eight  to  twelve. 
Some  taught  only  one  grade  while  others  taught  four  grades. 
They  ranged  from  one  to  seven  years  of  academic  and  pro¬ 
fessional  training  beyond  high  school.  In  total  experience 
the  range  was  from  two  to  thirty-five  years  and  the  experi¬ 
ence  in  their  present  school  ranged  from  one  to  seventeen 
years.  Some  of  the  teachers  were  recently  from  England, 

The  United  States,  Iceland,  and  Saskatchewan.  The  size  of 
the  schools  ranged  from  a  low  of  eleven  members  on  a  staff 
to  a  high  of  twenty-seven. 

III.  SUMMARY 

The  data  for  this  research  were  collected  from  12 
schools  in  Alberta  through  personal  visitation  by  the 
investigator.  It  involved  59  teachers  who  were  the  centre 
of  the  study  and  whose  behaviors  were  described  and 
effectiveness  rated  by  themselves,  by  571  students,  by  12 
principals,  by  12  superintendents,  and  by  50  other  teachers 
completing  241  questionnaires. 

Questionnaires  were  administered  to  1,004  persons. 
Fifteen  were  eliminated  because  they  were  only  partially 
completed  or  the  directions  were  not  followed.  The 
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remaining  989  questionnaires  were  utilized  in  the  study. 

The  teachers  involved  in  the  study  included  a  wide 
variety  of  subject  specialists *  both  academic  and  non- 
academic;  they  were  teachers  with  a  wide  range  of  years  of 
training*  total  experience*  and  experience  in  their  present 
school.  Some  were  recently  from  areas  outside  Alberta. 
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CHAPTER  V 


RESULTS ,  ANALYSIS  AND  DISCUSSION  OF  EFFECTIVENESS  RATINGS 

I.  RESULTS  AND  ANALYSIS 

Group  Averages  and  Deviations 

In  analysing  the  effectiveness  rating  scores  for  the 
fifty-nine  teachers  there  was  no  assumption  of  normal  distri¬ 
bution  of  scores  nor  of  equal  intervals  on  the  rating  scale. 
Thus,  in  most  cases  non-parametric  statistics  were  used. 

Table  I  summarizes  the  medians,  quartile  points,  and 
quartile  deviations  for  the  five  sets  of  raters  when  judging 
the  effectiveness  of  the  fifty-nine  teachers.  Since  a  rating 
of  ”1”  was  "outstanding  for  this  school"  and  a  rating  of  "6" 
was  "very  low  for  this  school"  (see  Appendix  A) ,  a  low  score 
indicated  a  higher  rating  and  a  high  score  indicated  a  lower 
rating. 

In  the  case  of  the  students  and  other  teachers,  the 
score  for  each  teacher  was  derived  from  the  median  of  the 
ratings.  In  the  case  of  the  other  three  sets  of  raters,  the 
principals,  the  superintendents,  and  the  teachers  themselves, 
single  ratings  provided  their  judgment  about  each  teacher's 
effectiveness.  Thus,  only  the  first  two  sets  of  ratings 
utilized  the  opinion  of  more  than  one  individual  per  teacher. 
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TABLE  I 

MEDIANS,  QUARTILE  POINTS  AND  QUARTILE  DEVIATIONS  OF  THE 
EFFECTIVENESS  RATINGS  FOR  FIFTY-NINE  TEACHERS 


Ratings  provided  by: 
Other 


Students 

Teachers 

Principals 

Sup  er int  endent  s 

Teachers 

Ql 

1.91 

1.87 

1.94 

2.09 

2.35 

0-2 

2.39 

2.38 

2.48 

2.78 

3.04 

«3 

2.99 

3.06 

3.39 

3.42 

3.43 

Q 

.54 

.60 

.72 

.  66 

.54 

NOTE:  If  the  teachers  were  average  for  the  school,  the 

median  should  be  3.50. 

Higher  ratings  are  indicated  by  smaller  scores 
and  lower  ratings  are  indicated  by  higher  scores.  See  global 
rating  scale  Appendix  A. 
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Whenever  the  effectiveness  ratings  by  students  or  other 
teachers  are  discussed,  the  ratings  refer  to  the  single, 
composite  rating  obtained  for  each  teacher  from  the  median 
of  the  ratings  by  the  ten  students  and  from  the  median  of 
the  ratings  by  the  five  other  teachers. 

The  highest  median  rating  in  Table  I  was  provided  by 
students  and  other  teachers,  followed  by  principals,  superin¬ 
tendents,  and  the  teachers  themselves.  All  five  sets  of 
raters  saw  the  average  of  the  fifty-nine  teachers  as  being 
above  the  average  of  the  scale  which  was  3.50.  Therefore, 
the  tendency  to  rate  high  found  in  other  studies  was  sup¬ 
ported.  Contrary  to  the  expectations  expressed  to  the 
investigator  by  many  teachers,  the  students  rated  the  teachers 
highly. 

The  principals  spread  their  ratings  out  over  a  wider 
range  than  did  the  other  raters  as  indicated  by  the  quartile 
deviations  (Q) .  The  fact  that  ten  students  and  five  other 
teachers  had  their  ratings  averaged  to  provide  a  single  rating 
would  tend  to  nullify  any  extremes  among  these  two  groups  of 
judges. 

For  the  students'  ratings  the  quartile  deviation  was 
calculated  for  each  teacher.  Of  the  59  deviations  so  calcu¬ 
lated  the  greatest  was  .91  and  the  lowest  was  .25  with  the 
mean  at  .55.  For  the  other  teachers'  ratings  the  quartile 
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deviations  were  calculated  for  54  of  the  59  teachers  and 
they  ranged  from  1.00  to  0  with  the  mean  at  .48.  There  was 
no  significant  difference  at  the  .05  level  between  the  mean 
of  the  deviations  for  the  students  and  the  mean  of  the  devi¬ 
ations  for  the  other  teachers. 

An  analysis  of  the  range  of  scores  was  provided  by 
the  frequency  with  which  the  extremes  of  the  rating  scale 
were  used.  Table  II  summarizes  this  information.  Students 
and  teachers  themselves  did  not  rate  any  teacher  in  the  two 
lowest  categories.  The  number  of  teachers  who  were  rated 
"1.  Outstanding  for  this  school"  were  few,  but  fairly  con¬ 
sistent  for  four  of  the  sets  of  raters. 

For  use  in  a  later  part  of  the  study  the  ten  students 
for  each  teacher  were  divided  into  two  groups  at  random. 

The  purpose  of  this  procedure  was  to  provide  independent 
judges  for  effectiveness  ratings  and  behavior  descriptions. 
Thus,  one  half-group  of  students  would  provide  effectiveness 
ratings  and  the  other  half-group  of  students  would  provide 
behavior  descriptions.  At  this  stage  of  the  analysis  only 
the  half-group  which  provided  the  effectiveness  ratings  was 
studied.  The  median  of  the  effectiveness  ratings  of  this 
half-group  of  students  was  2.3^  which  was  practically 
identical  to  the  median  obtained  from  the  whole  group  of 
students  (2.39).  The  quartile  deviation  was  .60  which  was 
slightly  higher  than  that  reported  for  the  whole  group  (.54). 
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Table  III  indicated  the  rank-order  correlations 
between  effectiveness  ratings  provided  by  the  sets  of  raters. 
There  was  a  large  number  of  ties  in  the  rank-orders,  thus, 
the  formula  given  by  Siegel  (l,  p.  203)  for  tied  ranks  was 
used. 

There  was  a  consistent  lack  of  relationship  between 
self-ratings  and  effectiveness  ratings  by  the  other  sets  of 
judges. 

The  superintendents  and  principals  showed  the  most 
agreement  and  the  superintendents  and  students  showed  the 
least  agreement  (excluding  the  self-ratings).  To  test  the 
significance  of  the  difference  between  these  two  correlations 
parametric  statistics  had  to  be  used.  Since  N  was  59  the 
frequency  was  not  too  small  to  use  parametric  statistics,  but 
the  procedure  can  be  criticized  on  the  bases  that  the  inter¬ 
vals  of  the  scale  were  not  equal  and  the  scores  might  not  be 
normally  distributed.  The  z*  transformation  was  applied  and 
the  difference  between  the  correlations  of  .68  and  .  4l  was 
found  to  be  significant  at  the  .05  level.  Thus,  one  may  con¬ 
clude,  keeping  in  mind  the  above  criticism,  that  there  was 
significantly  more  agreement  between  superintendents  and 
principals  than  there  was  between  the  superintendents  and 
students  on  the  effectiveness  ratings.  Excluding  self-ratings. 
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TABLE  III 

RANK- ORDER  CORRELATIONS  BETWEEN  RATERS 


Students 

Other 

teachers 

Principals 

Superintendents 

Other  teachers 

.56** 

Principals 

,43** 

.53** 

Sup  er int  endent  s 

.41** 

.52** 

.68** 

Teachers 

-.02 

.02 

-.01 

.05 

**  significant  at  the  .01  level. 
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none  of  the  other  differences  between  correlations  were  sig¬ 
nificant.  Except  for  the  highest  and  lowest  correlations, 
which  were  different,  the  correlations  and  therefore  the 
agreement  between  raters  appeared  to  be  similar. 

A  further  indication  of  consensus  was  derived  by 
calculating  Kendall's  Coefficient  of  Concordance  (W)  (l,  p. 
229) .  This  coefficient  provided  the  amount  of  consensus  in 
the  rank-order  of  the  effectiveness  ratings  by  more  than  two 
judges.  It  did  not  indicate  the  degree  of  correctness  of  the 
rankings,  as  all  judges  could  agree  completely,  yet  be  wrong. 
It  is,  however,  a  useful  statistic  when  there  are  numerous 
judges  and  when  there  is  no  external  criterion  with  which  to 
check  the  validity  of  the  ratings.  The  range  of  possible 
values  for  W  is  from  zero  to  plus  one.  Thus,  if  W  was  equal 
to  zero,  it  would  indicate  no  agreement,  while  if  W  was  plus 
one,  it  would  indicate  perfect  agreement.  An  indication  of 
the  meaning  of  the  W  coefficients  was  obtained  by  translating 
them  to  an  equivalent  average,  rank-order  correlation  (l, 
p.  229).  This  is  the  average  of  all  the  possible  corre¬ 
lations  between  the  judges  included  for  any  one  W  coefficient. 

Table  IV  summarizes  the  W  coefficients  for  three 
combinations  of  judges.  The  first  combination  included  those 
judges  from  within  the  school  other  than  the  teacher  himself; 
the  second  consisted  of  the  superintendent  in  addition  to  the 
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first  three,  and  the  last  combination  included  all  five  sets 
of  judges  at  once.  As  an  additional  judge  was  added  the 
amount  of  consensus  decreased  although  the  equivalent  average, 
rank-order  correlation  was  highest  for  the  second  combination. 
All  W  coefficients  were  significant  at  the  .01  level,  but  they 
did  not  indicate  a  great  deal  of  consensus. 


TABLE  IV 

CONSENSUS  AMONG  JUDGES  ABOUT  EFFECTIVENESS  RATINGS 


Judges 

W* 

Equivalent 

rank-order 

average, 

correlation 

(l)  Students,  Other  teachers, 
and  Principals 

.64 

.42 

(2)  Students,  Other  teachers. 
Principals,  and  Superintendents 

.61 

00 

-=t 

• 

(3)  Students,  Other  teachers. 
Principals,  Superintendents, 
and  Teachers  themselves 

.43 
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*  All  W  coefficients  were  significant  at  the  .01  level. 

The  amount  of  agreement  at  the  extremes  of  the  rating 
scale  was  analyzed.  For  a  rating  of  ”1"  there  was  no  teacher 
so  rated  by  all  five  sets  of  judges.  One  teacher  appeared  in 
this  category  for  four  sets  of  raters.  No  teacher  was  rated 
"l”  by  three  sets  of  raters  and  only  one  was  so  rated  by  two 
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sets  of  raters.  Six  of  the  teachers  were  rated  in  category 
l"  by  only  one  set  of  raters.  Thus,  out  of  eight  teachers 
who  were  rated  in  category  "l"  of  the  rating  scale,  only  two 
were  placed  in  this  category  by  more  than  one  set  of  judges. 
Only  one  teacher  was  rated  ”6”  and  that  was  by  a  principal. 
There  was  not  a  teacher  placed  in  category  "5"  by  all  five 
sets  of  raters,  nor  by  four  or  three  sets  of  raters.  Two 
teachers  were  rated  n5"  by  two  sets  of  raters  and  seven  were 
placed  in  this  category  by  only  one  set  of  raters.  Thus, 
out  of  nine  teachers  who  were  rated  in  category  ,I5U,  only 
two  were  placed  in  this  category  by  more  than  one  set  of 
judges.  When  compared  to  the  amount  of  agreement  possible, 
it  can  be  said  that  there  was  lack  of  agreement  about  rating 
of  teachers  at  the  extremes  of  the  scale. 

Variations  by  Schools  from  the  Average  for  the  Scale 

Originally,  the  investigator  considered  making  an 
adjustment  to  the  effectiveness  ratings  to  take  into  account 
the  variation  of  the  average  score,  of  the  five  teachers  on 
each  staff,  from  the  average  score  of  the  scale.  Since  the 
scale  was  so  worded  that  the  judges  were  to  use  as  their 
standard  the  average  teacher  of  each  school,  this  might  have 
been  a  logical  adjustment.  If  all  the  teachers  on  a  staff 
were  rated,  the  median  rating  should  be  close  to  3*50  which 
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was  the  average  of  the  scale.  Due  to  the  fact  that  only  five 
teachers  in  any  one  school  were  used  and  those  selected  were 
from  above  grade  seven,  the  assumption  that  the  average  rat¬ 
ing  in  each  school  should  be  the  average  of  the  scale,  was 
dropped.  It  was  considered  plausible  that  the  average  rating 
of  the  five  teachers  selected  might  vary  from  3.50.  An 
investigation  showed  that  for  effectiveness  ratings  provided 
by  students  the  medians  were  nearly  all  around  one  unit  above 
the  centre  of  the  scale.  The  consistency  of  the  average  rat¬ 
ing  of  the  five  teachers  from  school  to  school  was  very 
noticeable.  Some  of  the  previous  rank-order  correlations 
were  calculated  with  the  use  of  adjusted  ratings  and  there 
did  not  appear  to  be  any  significant  difference  from  the 
correlations  using  the  raw  ratings. 

II.  DISCUSSION 

The  results  indicated  that  the  median  effectiveness 
ratings  of  the  fifty-nine  teachers  were  above  the  average 
for  the  scale.  The  teachers  rated  themselves  as  being  lower 
in  average  effectiveness  than  did  any  of  the  other  judges. 

There  was  some  agreement  among  the  judges  as  to  the 
rank-order  of  the  effectiveness  ratings.  Since  only  the 
lowest  and  highest  rank-order  correlations  were  significantly 
different,  there  was  some  similarity  in  the  amount  of  agree- 


Co  9J  .  ;  .  :  .  •  EBW 

:  .  -  .  . 

i  ,  i  •  .  • 

=  .  '  ■;  n  .  :  -j.  •..■  .»  «  bf.u  -i...  Icc.icr/,  nor.  9  nJ.  vnl 

V  :  •  ■  .  .  ;  ••  r.  \J  i.;  o  •  /  ,  ...  .  , 


•-  .  .  :y  vi  .  ,  <j.  .  1>  ■  3  .  .■  ' ...  ' . 

.  \  ■  .  c  a  .  '  >i  ' .  2  1  ..  :  .  I 

•  ■  .  ... 

• .. ■  v  •...  ,-w  .  .  .  i  ..  Oi  >r . 

.  -  moil  • .  os  .  , 

■  . 

■  v  ;  ,  ■  *jl  -  i.  ■  .  ■  . o  j.  ...  •jLiy.'  o‘i  .>.: 

--'.Lx  ...  no.'  7.  .  .  ..  j  ...  i  ;  :  Jc.t  jjJ  .0 

.  -  ;ri  r.;  .  :  3^..'  '  .  J.  ei;..  ;  o 


i  •  ...  >  .  C,A.c  Cm  ii  .  ..  Li  J  £  JJ  G 

o  •  ...  ..... 

■  •  ,  .  ..  .  ■■.  . 

' . :  .  •  . •  •  9*1  ..  .. 

■  '  .  .  .  .  •  .  ■ 

'  ••  •  ••  '  '  .  .  -J ...  .  '■  v.'  ; .  ■  A !•'...=  v  .  ./. 

.. .  .  .  ill  '  .  ..  .  .  •„  2 


65 

ment  between  pairs  of  judges.  The  total  consensus  although 
significant  in  all  cases  was  not  considered  very  great. 

There  appeared  to  be  just  about  as  much  consensus  with  the 
superintendent  added  to  the  other  three  judges  (excluding 
the  teachers  themselves)  as  there  was  with  just  these  three 

(students,  other  teachers,  and  principals).  At  the  extremes 

* 

of  the  rating  scale  there  was  lack  of  agreement  compared  to 
complete  agreement. 

The  greatest  amount  of  disagreement  was  between  the 
teachers’  effectiveness  ratings  of  themselves  and  the 
effectiveness  ratings  by  the  remaining  sets  of  judges.  Also, 
as  noted  above,  the  teachers  judged  themselves  as  being 
lower  in  average  effectiveness  than  did  any  of  the  other 
judges.  The  investigator  felt  that  motivation  rather  than 
frame  of  reference  or  ability  to  judge  was  the  important 
factor.  Many  teachers  tended  to  be  modest.  This  was  evident 
not  only  in  the  results,  but  also  in  their  comments  to  the 
investigator.  Some  stated  that  the  scale  was  very  difficult 
to  use  since  it  did  not  include  a  category  "average  for  this 
school."  This  was  the  rating  that  many  teachers  indicated 
they  would  prefer  to  give  themselves.  There  is  also  a  possi¬ 
bility  that  competent  teachers  more  readily  realize  their 
inadequacies  and  tend  to  rate  themselves  lower  than  external 
judges  would  rate  them. 
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The  literature  in  Chapter  II  regarding  judging  seemed 
to  indicate  that  superintendents  would  be  the  most  competent 
judges  due  to  their  broader  experience  allowing  them  to 
develop  more  appropriate  norms,  and  due  to  more  practice  in 
actual  evaluation  which  might  develop  their  ability  to  judge. 
It  was  not  possible  to  verify  the  correctness  of  their  judg¬ 
ment,  but  they  judged  the  teachers  as  a  whole  as  being 
closer  to  the  mid-point  of  the  scale  than  did  any  other 
external  judge.  Also,  they  agreed  to  a  greater  extent  with 
the  principals  than  they  did  with  any  other  group. 

The  highest  relationship,  between  the  superintendents’ 
and  the  principals’  effectiveness  ratings,  was  significantly 
different  from  the  lowest  relationship,  between  the  super¬ 
intendents’  and  the  students'  effectiveness  ratings.  The 
greatest  amount  of  agreement  between  superintendents  and 
principals  might  be  explained  by  the  fact  that  they  were  in 
the  most  similar  position  with  regard  to  experience  in 
evaluating  teachers  and  supervisory  contact  with  teachers. 

The  superintendents  and  students  would  be  in  the  most  dis¬ 
similar  position  since  the  literature  indicated  that  students 
would  have  the  least  ability  to  judge  other  individuals,  and 
would  have  the  least  opportunity  to  develop  a  frame  of  refer¬ 
ence  for  judging.  Also,  the  superintendents  had  the  least 
contact  with  the  teachers  in  the  classroom  while  the  students 
had  the  most  contact  with  the  teachers  in  the  classroom. 
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CHAPTER  VI 


RESULTS,  ANALYSIS  AND  DISCUSSION  OF  DESCRIPTIONS 
OF  TEACHER  LEADER  BEHAVIOR 

I.  RESULTS  AND  ANALYSIS 
Group  Averages  and  Deviations 

The  scores  on  the  TLBDQ  were  calculated  separately  for 
the  dimensions  of  Consideration  and  Initiating  Structure.  A 
relatively  high  score  on  either  dimension  indicated  that  the 
teacher  behaved  in  a  certain  manner  more  often  than  a  teacher 
did  with  a  relatively  low  score.  The  averages  for  the  entire 
group  of  teachers  were  derived  by  calculating  the  medians. 

Table  V  summarizes  the  medians,  quartile  points,  and 
quartile  deviations  for  the  five  sets  of  judges  when  describ¬ 
ing  the  behavior  of  the  fifty-nine  teachers. 

In  the  case  of  the  students  and  other  teachers,  means 
were  used  to  give  an  average  measure  per  teacher.  This  was 
the  procedure  used  with  the  LBDQ,  and  also,  the  investigator 
believed  that  extremes  in  behavior  scores  should  be  given  full 
weight  in  computing  the  average  description  for  each  teacher. 
In  the  case  of  the  other  three  sets  of  judges,  the  principals, 
the  superintendents,  and  the  teachers  themselves,  single 
scores  on  each  dimension  provided  their  judgment  about  each 
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MEDIANS,  QUARTILE  POINTS  AND  QUARTILE  DEVIATIONS  OF  THE 
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teacher’s  behavior.  Whenever  the  behavior  descriptions  by 
students  or  other  teachers  are  discussed,  they  refer  to  the 
single,  composite  score  obtained  for  each  teacher  from  the 
mean  of  the  scores  by  the  ten  students  and  from  the  mean  of 
the  scores  by  the  five  other  teachers. 

As  shown  in  Table  V  all  the  medians  for  Initiating 
Structure  were  higher  than  those  for  Consideration  except 
for  teachers’  descriptions  of  themselves.  This  was  opposite 
to  the  result  found  by  Halpin,  with  non-teachers,  when  the 
medians  for  superintendents  and  a  sample  of  aircraft  com¬ 
manders  were  higher  on  the  Consideration  dimension  (l,  p. 

10 ) .  There  was  slightly  more  agreement  on  the  medians  for 
Consideration  than  on  the  medians  for  Initiating  Structure. 

The  highest  Initiating  Structure  median  was  provided 
by  other  teachers  and  the  lowest  by  students.  The  highest 
Consideration  median  was  provided  by  teachers'  descriptions 
of  themselves  and  the  lowest  was  provided  by  students. 

The  quartile  deviations  (Q)  of  the  descriptions  of  the 
fifty-nine  teachers  were  lowest  for  the  students  and  other 
teachers,  while  the  superintendents  showed  the  greatest  devi¬ 
ations  in  their  descriptions  of  the  fifty-nine  teachers. 

The  variability  of  the  descriptions  provided  by  the 
ten  students  and  five  other  teachers  was  calculated  for  each 
of  the  fifty-nine  teachers  by  obtaining  the  standard 
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deviation.  The  means  of  the  standard  deviations  on  Initi¬ 
ating  Structure  were  6.24  for  students  and  6.03  for  other 
teachers.  These  were  not  significantly  different  at  the  .05 
level.  The  means  of  the  standard  deviations  on  Consideration 
were  6.8l  for  students  and  6.09  for  other  teachers  with  the 
difference  not  being  significant  at  the  .05  level.  There  was 
no  significant  difference  between  the  mean  standard  deviation 
on  Consideration  and  the  mean  standard  deviation  on  Initi¬ 
ating  Structure  for  students  nor  for  other  teachers. 

As  was  reported  in  Chapter  V  (see  page  57)  the  ten 
students  for  each  teacher  were  divided  into  two  groups  at 
random.  One  group  was  used  to  provide  effectiveness  ratings 
and  the  other  group  was  used  to  provide  behavior  descriptions. 
An  analysis  of  the  half-group  of  students  used  to  provide 
descriptions  of  behavior  resulted  in  an  Initiating  Structure 
median  of  41.2  and  Q  of  3.7*  while  the  Consideration  median 
was  39.8  and  Q  was  3.8.  This  indicated  that  the  medians  of 
the  descriptions  provided  by  one  of  the  half-group  of  students 
were  very  similar  to  those  provided  by  the  whole  group  of 
students  (Table  V) .  The  quartile  deviations  were  the  same 
for  the  half-group  of  students  and  the  whole  group  of  students 
on  Consideration,  but  on  Initiating  Structure  the  half-group 
had  a  larger  deviation  than  did  the  whole  group. 
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Independence  of  the  Dimensions 

The  background  research  on  the  LBDQ  indicated  that, 
although  the  two  dimensions  were  related.,  there  was  reason 
to  believe  they  provided  scores  on  dimensions  which  were 
somewhat  independent.  The  independence  of  the  two  dimensions 
on  the  TLBDQ,  was  checked  by  calculating  the  product -moment 
correlations  between  the  dimensions  for  each  of  the  five 
sets  of  raters  and  the  results  are  shown  in  Table  VI. 


TABLE  VI 

CORRELATIONS  BETWEEN  INITIATING  STRUCTURE  AND 
CONSIDERATION  SCORES  FOR  EACH  SET  OF  JUDGES 


Judges 

Correlations* 

Students 

.58 

Other  teachers 

.52 

Principals 

00 

• 

Superintendents 

.61 

Teachers 

ii 

it 

ii 

ii 

00  || 
on 

• 

II 

H 

II 

ii 

II 

II 

II 

II 

II 

H 

H 

II 

II 

H 

*  All  correlations  were  significant 
at  the  .01  level. 

The  greatest  amount  of  independence  was  found  between 
the  dimensions  for  the  behavior  descriptions  by  the  teachers 
themselves  and  the  least  amount  of  independence  for  the 
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behavior  descriptions  by  the  superintendents.  The  results 
reported  in  this  study  were  not  greatly  different  from  the 
various  correlations  reported  and  accepted  for  the  LBDQ,  but 
the  investigator  does  not  consider  them  indicative  of  a  high 
degree  of  independence. 

As  a  further  analysis  of  independence  two  other  cor¬ 
relations  were  calculated.  Both  correlations  were  obtained 
from  the  data  supplied  by  the  half-groups  of  students.  The 
first  was  the  correlation  between  the  dimensions  for  the  half¬ 
group  of  students  that  were  used  to  provide  behavior  descrip¬ 
tions  (see  page  71).  The  correlation  was  .52  as  contrasted 
with  .58  reported  for  the  whole  group  of  students  in  Table  VI. 
Thus,  there  was  about  the  same  amount  of  independence  between 
the  dimensions  when  the  half-group  described  the  teachers  as 
when  the  whole  group  described  them.  The  second  correlation 
was  determined  by  using  the  Initiating  Structure  scores  from 
one  half-group  and  the  Consideration  scores  from  the  other 
half-group.  In  this  instance  the  describers  were  all  students, 
but  the  scores  on  one  dimension  were  provided  by  one  group 
while  the  scores  on  the  other  dimension  were  provided  by  the 
other  group.  The  resulting  correlation  was  .38.  Although 
this  was  lower  than  either  of  the  other  correlations  reported 
above,  the  differences  were  not  statistically  significant  at 
the  .05  level.  The  difference  was  tested  by  making  the  z‘ 
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transformation.  This  procedure  was  used  throughout  to  test 
the  significance  of  the  difference  of  two  correlations. 

Consensus 


The  product-moment  correlations  between  the  descrip¬ 
tions  by  the  various  judges  are  recorded  in  Tables  VII  and 
VIII  and  they  indicate  the  amount  of  agreement  between  judges. 
There  was  some  agreement  between  judges  on  their  descriptions 
of  both  dimensions  of  a  teacher’s  leader  behavior  with 
students.  As  in  the  results  of  the  effectiveness  ratings 
discussed  in  Chapter  V,  there  was  little  or  no  agreement 
between  behavior  descriptions  by  the  teachers  themselves  and 
behavior  descriptions  by  other  judges.  The  one  noticeable 
difference  between  these  results  and  the  correlations 
recorded  for  the  effectiveness  ratings  was  the  significant 
correlations  obtained  between  the  students’  descriptions  and 
the  teachers’  descriptions  of  themselves  on  both  dimensions. 

On  both  dimensions  the  greatest  amount  of  agreement 
was  between  the  principals  and  the  other  teachers,  while  the 
least  amount  of  agreement  (excluding  self-descriptions)  was 
between  the  superintendents  and  the  students  on  Initiating 
Structure  and  between  the  superintendents  and  the  other 
teachers  on  Consideration. 

For  Initiating  Structure  the  difference  between  the 
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TABLE  VII 

CORRELATIONS  BETWEEN  JUDGES  ON  THE  INITIATING  STRUCTURE  SCORES 


Students 

Other 

teachers 

Principals 

Superintendents 

Other  teachers 

.51** 

Principals 

.38** 

.63** 

Superintendents 

.29* 

.4o** 

.48** 

Teachers 

.39** 

.23 

.14 

.17 

*  significant  at 

the  .05  level. 

**  significant  at 

the  .01  level. 

TABLE  VIII 

CORRELATIONS 

BETWEEN 

JUDGES  ON 

THE 

CONSIDERATION  SCORES 

Other 

Students 

teachers 

Principals 

Superintendents 

Other  teachers 

.55** 

Principals 

.44** 

.60** 

Superintendents 

.52** 

.38** 

.43** 

Teachers 

.42** 

.23 

.22 

.09 

** 


significant  at  the  .01  level. 
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correlation  showing  the  most  agreement  (principals  and.  other 
teachers)  and  that  showing  the  least  agreement  (superintendents 
and  students)  was  significant  at  the  .05  level.  Excluding 
correlations  with  self-descriptions.,  there  were  no  significant 
differences  between  other  pairs  of  correlations.  One  would 
conclude  that  the  amount  of  agreement  between  pairs  of  judges 
was  similar. 

For  Consideration  the  difference  between  the  correlation 
showing  the  most  agreement  (principals  and  other  teachers)  and 
that  showing  the  least  agreement  (superintendents  and  other 
teachers)  was  not  significant  at  the  .05  level.  Since  the 
correlations  between  all  pairs  of  judges  (excluding  self¬ 
descriptions)  were  not  significantly  different  from  each  other, 
one  would  conclude  that  the  amount  of  agreement  between  pairs 
of  judges  was  similar. 

The  similarity  between  correlations  of  Initiating 
Structure  and  the  correlations  of  Consideration  was  noticeable 
and  could  be  observed  by  comparing  TaMes  VII  and  VIII.  There 
was  one  exception  to  this  apparent  similarity  and  that  was  the 
correlations  between  superintendents’  scores  and  students’ 
scores.  The  correlation  of  Consideration  scores  appeared  to  be 
much  higher  than  that  for  Initiating  Structure  scores,  but  the 
difference  was  not  significant  at  the  .05  level.  All  correla¬ 
tions  with  scores  provided  by  students  were  higher  for 
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Consideration  than  they  were  for  Initiating  Structure,  but 
the  differences  were  not  significant. 

On  both  dimensions  the  students  exhibited  some  agree¬ 
ment  with  all  other  judges  including  the  teachers  themselves. 
All  of  the  eight  correlations  involving  the  students  were 
significant  at  the  .01  level  except  that  between  the  super¬ 
intendents  and  the  students  for  Initiating  Structure,  which 
was  significant  at  the  .05  level.  This  agreement  with  all 
other  judges  was  exhibited  only  by  students. 

The  Concordance  Coefficient  (W)  was  calculated  for  the 
rank-order  of  the  scores  on  both  dimensions  of  behavior. 

Table  IX  summarizes  these  results.  An  indication  of  the 
meaning  of  the  W  coefficients  was  obtained  by  translating 
them  to  equivalent,  rank-order  correlations  (3*  P.  229). 

All  coefficients  were  significant  at  the  .01  level, 
but  did  not  indicate  a  great  deal  of  consensus  on  what  were 
believed  to  be  easily  observed  behavior  items.  There  was  a 
slight  tendency  for  more  agreement  on  the  Consideration 
dimension  than  on  the  Initiating  Structure  dimension.  The 
consensus  decreased  with  additional  judges.  This  was  also 
observed  with  the  effectiveness  ratings  in  Chapter  V. 
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TABLE  IX 

CONSENSUS  AMONG  JUDGES  ABOUT  BEHAVIOR  DESCRIPTIONS 


Judges  Behavior  dimension 

Initiating  Consideration 
Structure 

W*  W* 


(l)  Students,  Other  teachers, 
and  Principals 

.  60 

(.40) 

.69 

(.51) 

(2)  Students,  Other  teachers. 
Principals,  and  Superintendents 

.52 

(.36) 

.61 

(.48) 

(3)  Students,  Other  teachers. 
Principals,  Superintendents, 
and  Teachers  themselves 

.45 

(.31) 

00 

• 

(.35) 

*  All  W  coefficients  were  significant  at  the 
.01  level. 

NOTE:  The  equivalent  average,  rank-order  corre¬ 

lations  are  in  brackets. 
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II.  DISCUSSION 

The  results  indicated  that  all  judges  except  the 
teachers  themselves  believed  that  the  teachers  on  the  average 
exhibited  more  Initiating  Structure  behavior  than  Consider¬ 
ation  behavior.  The  quartile  deviations  showed  that  the  super¬ 
intendents  deviated  more  on  their  descriptions  of  the  fifty- 
nine  teachers  on  both  dimensions  than  did  any  other  judges. 

In  the  mean  of  the  variabilities  of  the  descriptions  provided 
by  the  ten  students  for  each  teacher  there  was  no  significant 
difference  between  the  dimensions.  The  same  result  was 
observed  for  other  teachers.  There  was  as  much  mean  variation 
on  Consideration  as  'there  was  on  Initiating  Structure.  Also, 
there  was  no  significant  difference  between  the  means  of  the 
variabilities  of  the  ten  students’  descriptions  and  the  means 
of  the  five  other  teachers’  descriptions  on  both  dimensions. 

As  with  the  effectiveness  ratings,  the  results  have 
shown  that  there  was  some  agreement  between  most  pairs  of 
judges  on  the  scores  of  both  leader  behavior  dimensions.  The 
results  were  not  high  enough  to  indicate  that  the  TLBDQ  could 
be  used  by  one  or  two  sets  of  respondents  with  the  assumption 
that  the  behavior  descriptions  would  be  all-inclusive.  The 
results  indicated  that  these  five  sets  of  raters  used  varying 
frames  and  points  of  reference  in  their  perception  of  the 
leader  behavior  of  the  teacher. 
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The  amount  of  general  consensus  decreased  as  more 
judges  were  included.  There  was  slightly  more  consensus  on 
Consideration  than  on  Initiating  Structure.  The  amount  of 
consensus  on  the  effectiveness  ratings  did  not  vary  greatly 
from  the  amount  of  consensus  on  the  behavior  dimensions. 
Thus,  it  was  just  as  difficult  to  agree  on  the  teacher’s 
leader  behavior  in  the  classroom  as  it  was  to  agree  on  the 
teachers’  effectiveness. 

The  students  agreed  to  some  extent  with  all  of  the 
other  four  sets  of  judges  on  both  dimensions.  Students  may 
be  in  a  position  to  have  developed  a  frame  of  reference 
which  includes  parts  of  the  frames  of  reference  of  all  the 
other  judges.  This  could  be  expected  since  the  students 
would  have  experienced  the  teacher's  classroom  behavior  to 
a  much  greater  extent  than  any  of  the  other  judges. 

The  highest  and  lowest  correlations  for  Initiating 
Structure  were  significantly  different  and  indicated  that 
principals  and  other  teachers  exhibited  the  most  agreement 
while  students  and  superintendents  exhibited  the  least 
(excluding  the  teachers  themselves).  When  everyday  behavior 
of  the  teacher  in  the  classroom  is  being  described,  those 
who  were  the  closest  in  similarity  of  contact  with  the 
teacher  were  the  principal  and  other  teachers,  since  most 
of  the  principals  in  this  study  spent  the  majority  of  their 
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time  teaching.  The  superintendents  and  the  students  were  in 
the  most  dissimilar  position  relative  to  describing  the 
behavior  of  the  teacher  and  on  the  Initiating  Structure 
dimension  they  showed  the  least  agreement. 

The  highest  and  lowest  correlations  for  Consideration 
were  not  significantly  different,  when  the  correlations  with 
self-descriptions  were  excluded.  The  amount  of  agreement 
between  pairs  of  judges  was  very  similar. 

The  fact  that  there  was  no  significant  difference 
between  the  highest  and  lowest  correlation  (excluding  self- 
descriptions)  on  Consideration,  that  there  tended  to  be  more 
agreement  on  the  Consideration  dimension  than  on  the  Initiat¬ 
ing  Structure  dimension,  and  that  two  judges,  who  were  remote 
from  each  other,  (superintendents  and  students)  exhibited  a 
relatively  higher  agreement,  might  be  partly  explained  by 
the  nature  of  the  Consideration  dimension.  The  human 
relations  behavior  of  the  teacher  in  the  classroom  might  be 
much  the  same  as  his  relations  with  other  individuals  in 
general.  Thus,  superintendents,  although  remote,  might  more 
easily  infer  from  their  contact  with  the  teacher  outside  the 
classroom,  what  human  relations  behaviors  the  teacher  would 
exhibit  in  the  classroom.  This  does  not  appear  to  be  as 
evident  for  Initiating  Structure.  The  teacher’s  means  of 
developing  particular  structure  between  himself  and  the 
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members  of  the  class  to  attain  goals  and  to  provide  for  a 
profitable  learning  situation  might  not  be  so  easily  inferred 
especially  by  a  person  as  remote  as  the  superintendent.  It 
should  be  noted  that  many  of  the  local,  provincially  appointed, 
superintendents  do  not  have  much  classroom  contact  with  senior 
high  school  teachers. 

The  position  of  the  other  teachers  in  this  study  deserves 
discussion.  The  relatively  higher  agreement  that  was  exhibited 
between  them  and  principals  on  both  dimensions  than  between  any 
other  pair  of  judges  was  unexpected.  One  possible  reason  for 
this  has  been  explained  above.  They  and  the  principal  are  prob¬ 
ably  in  the  most  similar  position  relative  to  the  teacher  when 
it  comes  to  describing  behavior.  Yet  the  other  teachers, 
rarely,  if  ever,  observe  another  teacher  in  the  classroom.  In 
some  large  staffs  they  have  very  little  association  with  some 
members  of  the  staff.  Other  teachers  consistently  expressed 
the  opinion  to  the  investigator  that  their  descriptions  of 
the  teacher’s  behavior  with  pupils  would  be  invalid.  They 
believed  that  they  were  only  guessing.  Principals  never 
expressed  this  opinion.  Many  other  teachers  stated  that  their 
descriptions  would  not  agree  with  those  descriptions  by  otner 
judges  and  it  was  unnecessary  to  prove  this  true.  With  this 
belief  the  investigator  tended  to  agree,  but  it  was  not  sub¬ 
stantiated  by  the  results.  To  the  contrary,  the  correlations 
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between  the  principals  and  the  other  teachers  proved  the 
highest  on  both  dimensions.  There  are  two  possible  reasons 
for  this.  Firstly ,  the  principals  might  not  have  observed 
the  teacher’s  classroom  behavior  any  more  than  the  other 
teachers.  Very  few  principals  make  a  practice  or  have  the 
time  to  visit  the  classroom.  Secondly,  and  this  is  supported 
by  the  correlations  between  students  and  other  teachers,  class¬ 
room  leader  behavior  of  the  teacher  may  be  inferred  from 
general  contact  with  the  teacher.  The  type  of  behavior  meas¬ 
ured  by  the  TLBDQ  may  be  similar  to  the  behavior  that  a 
teacher  exhibits  in  his  association  with  other  people  involved 
in  the  same  type  of  work.  This  latter  explanation  has  been 
discussed  previously.  It  was  believed  that  Consideration 
might  be  more  easily  inferred  from  general  contact  with  the 
teacher  than  would  Initiating  Structure. 

It  was  concluded  that  there  was  a  considerable  amount 
of  disagreement  between  self-descriptions  and  those  provided 
by  the  other  judges.  A  similar  disagreement  was  reported  by 
Hemphill  and  Coons  (2,  p.  19).  There  was  a  significant  cor¬ 
relation  between  students’  descriptions  of  the  teacher  self¬ 
descriptions.  Since  there  was  no  significant  correlation 
between  effectiveness  ratings  by  other  judges  and  self-ratings 
as  reported  in  Chapter  V,  it  might  be  concluded  that  there  was 
less  of  the  motivational  factor  affecting  their  descriptions 
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of  themselves  than  there  was  affecting  their  effectiveness 
ratings  of  themselves.  Teachers  might  he  more  objective 
when  describing  themselves  than  when  rating  their  effective¬ 
ness. 


■  \  J:  ;*  '  *  V  -■  ,  .  \  t.  .  . 

.  .  ■  .  .  8  . 

••  .  .  ’  ■  ...  \ .  8j  sol  J  .  :d  no ,..  net  1  r 

.  '  i 3ii 


85 


REFERENCES 


(1)  Halpin,  A.  W.  Manual  for  the  Leader  Behavior  Description 
Questionnaire.  Columbus:  Bureau  of  Business  Research, 

The  Ohio  State  University,  1957. 

(2)  Hemphill,  J.  K. ,  and  A.  E.  Coons.  ’’Development  of  the 
Leader  Behavior  Description  Questionnaire,1'  Leader 
Behavior:  Its  Description  and  Measurement,  R.  M.  Stogdill 
and  A.  W.  Coons,  editors.  Pp.  6-38.  Research  Monograph 
Number  88.  Columbus,  Ohio:  Bureau  of  Business  Research, 
The  Ohio  State  University,  1957. 

(3)  Siegel,  S.  Nonparametrlc  Statistics  for  the  Behavioral 
Sciences.  Toronto:  McGraw-Hill  Book  Co.,  195^~T” 


'  i  ■  ' 


*  •  •  •  .  ) 

*  •  •  .  . 

.  ••  •  .  ) 

.  ..  >  '  6  • 

...  .  ; ,  _  _  _  .  -  _-V  . 

T 

.  ;  .  t  ...  .  C  -  j  £  . .. 


j  v  :  •  *  £  ...  • 


. 1  ■:  ■ 


v  . ) 


. . .  .  :■  •  -  ; . 


CHAPTER  VII 


THE  RELATIONSHIP  BETWEEN  EFFECTIVENESS  RATINGS 
AND  DESCRIPTIONS  OF  LEADER  BEHAVIOR 

I.  RESULTS  AND  ANALYSIS 
High  and  Low  Effectiveness  Groups 

The  most  important  analysis  of  the  data  involved 
selecting  the  most  effective  and  most  ineffective  teachers 
and  comparing  the  behavior  scores  of  these  two  groups.  This 
was  done  to  test  the  first  hypothesis  which  stated  that. 
Students  and  superiors  (principals  and  superintendents)  rate 
those  teachers  highest  in  teaching  effectiveness  who  score 
above  the  median  on  both  of  the  leader  behavior  dimensions. 
Consideration  and  Initiating  Structure.  Students  and  super¬ 
iors  rate  those  teachers  lowest  in  teaching  effectiveness 
who  score  below  the  median  on  both  of  the  leader  behavior 
dimensions . 

To  select  for  each  judge  the  upper  and  lower  groups 
in  effectiveness,  the  cut  off  point  was  determined  by  examin¬ 
ing  the  effectiveness  ratings  to  try  and  choose  approximately 
the  same  number  of  teachers  for  both  groups.  However,  for 
some  judges  there  was  a  large  number  of  ties  and  this  meant 
that  some  effectiveness  groups  were  very  unequal. 
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The  most  effective  teachers  were  then  categorized 
according  to  their  scores  on  the  behavior  dimensions.  If 
their  scores  on  both  dimensions  were  above  the  median,  they 
were  placed  in  one  category,  and  if  their  scores  on  both 
dimensions  were  below  the  median,  they  were  placed  in  a 
second  category.  The  same  categorization  was  carried  out 
for  the  group  of  ineffective  teachers.  The  results  are 
summarized  for  each  judge  in  Table  X.  The  number  of  teachers 
who  did  not  fall  into  either  of  these  categories  (above  the 
median  on  both  dimensions  or  below  the  median  on  both  dimen¬ 
sions)  may  be  determined  from  the  difference  between  the 
total  in  the  group  and  the  total  in  the  two  categories.  For 
example,  in  the  case  of  judgments  by  students,  14  teachers 
were  in  the  most  effective  group  and  13  of  these  scored 
either  above  the  median  on  both  dimensions  or  below  the 
median  on  both  dimensions.  Thus,  only  1  of  these  most 
effective  teachers  was  above  the  median  on  one  dimension 
while  below  the  median  on  the  other  dimension. 

Fisher’s  Method  for  Exact  Probability  (l,  p.  96)  was 
applied  to  the  data  and  Table  X  lists  the  probabilities  that 
such  distributions  would  happen  by  chance. 

The  results  indicated  that  there  was  a  significant 
relationship  between  extremes  in  effectiveness  ratings  and 
behavior  scores,  except  for  the  judgments  by  teacners 
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themselves.  The  probabilities  that  such  distributions  would 


happen  by  chance  were  extremely  small.  Observation  of  the 
significant  relationships  showed  that  there  was  not  one  case 
of  a  teacher  being  in  the  lowest  effectiveness  group  and 
scoring  above  the  median  on  both  dimensions.  Most  teachers 

who  were  judged  as  being  ineffective  exhibited  behavior  which 
was  described  as  below  the  median  on  both  dimensions.  Nearly 
all  the  teachers  who  were  judged  as  being  effective  exhibited 

behavior  which  was  described  as  above  the  median  on  both 
dimensions  of  leadership.  The  first  hypothesis  was  sub¬ 
stantiated  at  this  stage  of  the  analysis. 

A  further  analysis  of  the  same  data  was  carried  out 

by  utilizing  behavior  scores  and  effectiveness  ratings  pro¬ 
vided  by  different  judges.  The  purpose  of  this  procedure  was 
to  eliminate  the  possibility  of  contamination  as  a  factor 
causing  the  relationship  and  to  provide  a  further  test  of  the 
first  hypothesis.  Contamination  meant  that  a  judge  would  tend 

to  answer  both  instruments  in  the  same  manner.  That  is  the 
relationship  between  effectiveness  groups  and  behavior  des¬ 
criptions  might  have  been  caused  by  this  contamination  factor 
rather  than  being  an  indication  of  a  true  relationship.  When 

separate  judges  for  each  instrument  were  used,  contamination 
was  eliminated.  Table  XI  summarized  the  results  when  the 
behavior  descriptions  were  provided  by  students  and  effective¬ 
ness  rating  by  judges  other  than  students.  Table  XII  summarized 
the  results  when  tne  behavior  descriptions  were  provided  by 
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other  teachers*  Table  XIII  when  behavior  descriptions  were 
provided  by  principals*  Table  XIV  when  behavior  descriptions 
were  provided  by  superintendents*  and  Table  XV  when  behavior 
descriptions  were  provided  by  the  teachers  themselves. 

The  result  of  the  analysis  of  this  rearrangement  of 
the  data  was  that  behavior  descriptions  provided  by  other 
teachers  (Table  XII)  did  not  appear  to  be  significantly 
related  to  the  most  effective  and  ineffective  ratings  as 
judged  by  the  other  sets  of  raters.  The  only  exception  to 
this  was  for  effectiveness  ratings  provided  by  principals. 

For  students  and  superintendents  the  distributions 
were  significant  except  for  the  ones  involving  teachers  them¬ 
selves*  but  the  probabilities  that  such  distributions  would 
happen  by  chance  were  increased  over  the  probabilities  prior 
to  the  rearrangement. 

For  principals  the  distributions  were  significant 
when  superintendents  and  other  teachers  provided  the  ratings 
of  effectiveness.  The  probabilities  that  the  distributions 
would  happen  by  chance  were  increased  from  the  probabilities 
indicated  in  Table  X.  The  principals'  descriptions  resulted 
in  the  strongest  probabilities  that  the  distributions  would 
not  happen  by  chance. 

Table  XVI  summarized  the  results  for  the  half-groups 
of  students.  One  half-group  of  students  provided  the  ratings 
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from  which  the  most  effective  and  least  effective  teachers 
were  selected.  The  other  half-group  of  students  provided 
the  behavior  descriptions.  Although  both  behavior  descrip¬ 
tions  and  effectiveness  ratings  were  provided  by  students, 
they  were  not  the  same  group  of  students.  This  method  of 
analysing  the  data  also  eliminated  contamination.  The 
probability  was  significant,  but  increased,  that  such  a 
distribution  would  happen  by  chance. 

The  results  of  the  analysis  of  the  rearrangement  of 
the  data  confirmed  the  conclusion  that  the  first  hypothesis 
was  substantiated. 

Rank-order  Correlations 

Rank-order  correlations  were  calculated  between 
effectiveness  ratings  and  behavior  descriptions  using  the 
formula  for  tied  ranks.  Table  XVII  summarized  the  corre¬ 
lations  when  the  effectiveness  ratings  and  the  scores  on 
each  behavior  dimension  were  provided  by  the  same  set  of 
judges. 

All  the  correlations,  except  that  between  the 
teachers'  self-ratings  and  self-descriptions  of  Initiating 
Structure,  were  significant.  There  were  significant 
relationships  between  teaching  effectiveness  and  descriptions 
of  both  leader  behavior  dimensions  for  all  judges  except  the 


teachers  themselves. 
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Between  the  effectiveness  ratings  and  the  Initiating 
Structure  scores  the  greatest  amount  of  significant  relation¬ 
ship  was  provided  by  the  principals,  and  the  least  amount  of 
significant  relationship  was  provided  by  the  students. 

Between  the  effectiveness  ratings  and  the  Consideration  scores 
the  greatest  amount  of  significant  relationship  was  again 
provided  by  the  principals,  but  it  was  equal  to  the  relation¬ 
ship  shown  by  the  students.  The  least  amount  of  significant 
relationship  was  provided  by  the  teachers  themselves. 

The  second  hypothesis  stated  that.  Students  tend  to 
relate  effectiveness  of  teaching  with  teachers’  scores  on 
both  Consideration  and  Initiating  Structure,  but  Consideration 
is  more  closely  related. 

It  was  concluded  that  students  did  relate  effective¬ 
ness  to  both  behavior  dimensions  as  shown  by  the  significant 
correlations  in  Table  XVII.  To  check  whether  Consideration 
was  more  closely  related  to  effectiveness  ratings  than  was 
Initiating  Structure,  the  significance  of  the  difference  of 
the  correlations  was  tested  by  making  the  z>  transformation. 
The  same  criticism  as  was  made  previously,  when  a  parametric 
test  was  used,  applied  here.  Although  the  difference  appeared 
quite  large  and  in  the  hypothesized  direction,  it  was  not 
significant  at  the  .05  level. 
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A  further  check  of  this  hypothesis  was  provided 
through  an  analysis  of  the  data  pertaining  to  the  half¬ 
groups  of  students.  One  half-group  was  used  to  provide 
effectiveness  ratings  and  the  other  half-group  was  used 
to  provide  behavior  descriptions.  The  correlations  were 
.42  between  Initiating  Structure  and  the  effectiveness 
ratings  and  .47  between  Consideration  and  the  effective¬ 
ness  ratings.  Both  correlations  were  significant  at  the 
.01  level,  but  the  difference  between  them  was  not  signi¬ 
ficant.  This  agreed  with  the  conclusion  in  the  above  para¬ 
graph —  for  students,  effectiveness  was  related  to  both 
dimensions,  but  not  more  closely  related  to  Consideration. 

A  third  check  of  this  hypothesis  was  provided  by 
correlations  between  effectiveness  ratings  by  students  and 
behavior  descriptions  by  judges  other  than  students.  When 
behavior  descriptions  were  provided  by  principals  and  the 
effectiveness  ratings  by  the  students,  only  the  correlation 
involving  Consideration  was  significantly  related  as  may 
be  noted  from  Table  XVIII.  There  was,  though,  a  significant 
difference  at  the  .05  level  between  the  correlations.  Only 
one  dimension  was  significantly  related  to  effectiveness 
ratings,  and  therefore.  Consideration  was  more  closely 
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TABLE  XVIII 

CORRELATIONS  BETWEEN  EFFECTIVENESS  RATINGS  AND 
BEHAVIOR  DESCRIPTIONS  BY  SEPARATE  JUDGES 


Behavior  descriptions 
provided  hy: 

Students 

Ratings  provided  by: 
Principals  Superintendents 

Students- 

I. 

s. 

.40** 

.33** 

— 

C. 

.45** 

.34** 

Principals- 

I. 

s. 

.25 

— 

.49** 

- 

C. 

.55** 

.48** 

Superintendents- 

I. 

s. 

.30* 

.53** 

— 

c. 

.57** 

.49** 

NOTE:  *  significant  at  the  .05  level. 

**  significant  at  the  .01  level. 
I.  S.-  Initiating  Structure. 

C.  -  Consideration. 
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When  behavior  descriptions  were  provided  by  superin¬ 
tendents,  both  dimensions  were  significantly  related  to 
effectiveness  ratings  by  students.  The  difference  between 
them  was  significant  at  the  .05  level  in  the  hypothesized 
direction.,  indicating  that  Consideration  was  more  closely 
related  to  effectiveness  ratings  by  students. 

The  third  hypothesis  stated  that.  Superiors  (principals) 
and  superintendents)  tend  to  relate  effectiveness  of  teaching 
with  teachers’  scores  on  both  Consideration  and  Initiating 
Structure,  but  Initiating  Structure  is  more  closely  related. 

As  observed  in  Table  XVII,  when  the  principals  provided 
the  effectiveness  ratings  and  the  descriptions,  both  dimensions 
were  significantly  related  to  effectiveness  ratings,  but  there 
was  no  significant  difference.  This  same  result  is  observed 
in  Table  XVIII  when  the  superintendents  and  students  provided 
the  behavior  descriptions  and  the  principals  the  effective¬ 
ness  ratings.  Thus,  for  principals,  both  dimensions  of  teacher 
leader  behavior  were  related  to  effectiveness  ratings,  but 
Initiating  Structure  was  not  more  closely  related. 

As  observed  in  Table  XVII,  when  the  superintendents 
provided  the  effectiveness  ratings  and  the  descriptions,  both 
dimensions  were  significantly  related  to  effectiveness  ratings, 
but  there  was  no  significant  difference  between  them.  This 
same  result  is  observed  in  Table  XVIII  when  the  students  and 
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principals  provided  the  behavior  descriptions  and  the  super¬ 
intendents  the  effectiveness  ratings.  Thus,  superintendents 
related  teaching  effectiveness  to  both  dimensions  of  teacher 
leader  behavior,  but  Initiating  Structure  was  not  more  closely 
related. 

Other  teachers  significantly  related  teaching  effective¬ 
ness  to  behavior  on  both  dimensions  and  there  was  no  signifi¬ 
cant  difference  in  the  correlations.  Teachers  themselves 
related  teaching  effectiveness  only  to  Consideration.  These 
results  may  be  observed  in  Table  XVII. 

It  should  be  noted  that  all  the  significant  correlations 
discussed  in  this  section  indicated  partial  relationship  and 
not  perfect  relationship. 

The  correlations  might  be  a  result  of  the  contamination 
factor  explained  previously.  The  relationships  might  be  due 
to  judges  responding  to  both  instruments  in  a  like  manner. 

This  factor  would  be  eliminated  when  separate  judges  were  used 
to  describe  and  rate.  In  Table  XVIII  the  results  indicated 
that  there  was  still  a  relationship  when  contamination  had  been 
eliminated,  although  the  relationships  were  lower.  Contamina¬ 
tion  was  also  eliminated  when  the  students  were  split  into 
two  half-groups.  The  correlation  was  .42  between  Initiating 
Structure  and  the  effectiveness  ratings  for  the  half-groups 
of  students,  and  .57  between  Initiating  Structure  and 
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effectiveness  ratings  for  the  whole  group  of  students.  The 
difference  between  them  was  not  significant.  Between  Con¬ 
sideration  and  the  effectiveness  ratings  the  correlations  were 
.47  for  the  half-groups  of  students  and  .70  for  the  whole  group 
of  students.  The  difference  between  them  was  significant  at 
the  .05  level.  Contamination  apparently  was  a  factor  with 
Consideration,  not  with  Initiating  Structure,  but  even  with  it 
eliminated  the  correlations  were  significant. 

II.  DISCUSSION 

This  chapter  reported  and  analysed  the  data  used  to 
test  the  three  hypotheses.  The  first  hypothesis  stated  that 
those  teachers  who  were  most  effective  would  be  above  the 
median  on  both  dimensions  and  those  teachers  who  were  least 
effective  would  be  below  the  median  on  both  dimensions.  This 
was  supported.  All  judges,  except  the  teachers  themselves, 
confirmed  this  result.  When  separate  judges  were  utilized 
for  behavior  descriptions  and  effectiveness  ratings,  the 
result  was  still  supported  for  behavior  descriptions  by 
students,  principals,  and  superintendents  although  the  proba¬ 
bilities  that  such  distributions  would  happen  by  chance  were 
increased. 

The  method  of  analysis  that  was  used  allowed  a  tenta¬ 
tive  conclusion  that  the  extent  of  teacher  leader  behavior 
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was  related  to  effective  or  Ineffective  teaching.  When 
leader  behavior  was  thought  to  be  present  to  a  greater  than 
average  extent.,  the  teachers  were  effective;  when  leader 
behavior  was  thought  to  be  present  to  a  less  than  average 
extent ,  the  teachers  were  ineffective. 

The  second  hypothesis  stated  that  students  would 
relate  effectiveness  ratings  to  behavior  scores  on  both 
dimensions,  but  Consideration  would  be  more  closely  related. 
For  behavior  descriptions  and  effectiveness  ratings  by  the 
students,  both  dimensions  were  related  and  there  was  a  differ¬ 
ence  (but  not  significant)  in  the  hypothesized  direction. 

For  the  half -groups  of  students,  both  dimensions  were  related 
to  effectiveness  ratings  and  there  was  no  significant  differ¬ 
ence  between  them.  When  effectiveness  ratings  provided  by 
the  students  were  correlated  with  descriptions  provided  by 
the  superintendents,  both  dimensions  were  significantly 
related  and  Consideration  was  more  closely  related.  In  the 
case  of  behavior  descriptions  provided  by  the  principals,  only 
that  between  Consideration  and  the  effectiveness  ratings  was 
significant.  The  general  conclusion  was  that  students  related 
effectiveness  to  scores  on  both  dimensions  and  there  was  a 
slight  tendency  to  relate  their  ratings  of  effectiveness  more 
closely  to  Consideration. 

The  final  hypothesis  stated  that  both  behavior 
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dimensions  would  be  related  to  effectiveness  ratings  by 
principals  and  superintendents *  but  that  Initiating  Structure 
would  be  more  closely  related.  For  both  principals  and  super¬ 
intendents  there  was  a  significant  relationship  between  their 
effectiveness  ratings  and  behavior  descriptions  provided  by 
themselves*  students  and  the  other  administrator*  but  Initiat¬ 
ing  Structure  was  not  more  closely  related.  This  latter  con¬ 
clusion  was  unexpected  as  it  was  believed  that  administrators 
preferred  the  teacher  who  got  things  done.  In  this  study 
administrators  considered  the  teacher’s  human  relations  behavior 
in  the  classroom  as  important  a  part  of  effective  teaching  as 
his  Initiating  Structure  behavior. 

The  use  of  separate  judges  for  behavior  descriptions  and 
effectiveness  ratings  eliminated  the  contamination  factor.  When 
contamination  was  eliminated*  the  relationships  were  decreased* 
but  remained  significant.  For  the  students1  half -groups*  con¬ 
tamination  significantly  reduced  the  correlation  between  Con¬ 
sideration  and  effectiveness  ratings*  but  not  the  correlation 
between  Initiating  Structure  and  effectiveness  ratings.  In  the 
case  of  students  the  tendency  for  judges  to  respond  to  both 
instruments  in  the  same  manner  appeared  to  be  a  more  important 
factor  between  Consideration  and  effectiveness  than  it  did 
between  Initiating  Structure  and  effectiveness.  It  might  be 
that  general  "likes”  and  "dislikes"*  which  undoubtedly  affect 
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a  judge’s  response  on  the  global  rating,  have  a  greater 
effect  on  his  perception  of  a  teacher’s  human  relations 
behavior  than  the  effect  they  have  on  his  perception  of 
a  teacher's  Initiating  Structure  behavior. 
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CHAPTER  VIII 


CONCLUSIONS ,  IMPLICATIONS  AND  RECOMMENDATIONS 

An  attempt  has  been  made  to  study  elements  of  behavior 
common  to  teachers.  Empirical  research  in  other  fields  has 
suggested  that  dimensions  of  leader  behavior  might  be  appli¬ 
cable  to  teacher-student  relationships.  The  relationships 
between  these  dimensions  of  Consideration  and  Initiating 
Structure,  and  ratings  of  teaching  effectiveness  have  been 
investigated  for  fifty-nine  teachers  in  twelve  schools  in  the 
Province  of  Alberta.  Five  viewpoints  or  frames  of  reference 
have  been  utilized  to  provide  global  effectiveness  ratings 
and  descriptions  of  teacher  leader  behavior;  they  were  students, 
other  teachers  on  the  same  staff,  principals,  superintendents, 
and  the  teachers  themselves. 

The  only  evidence  provided  that  the  conclusions  could 
be  generalized  beyond  the  persons  in  the  study  was  the  wide 
variation  of  experience,  training,  grades  and  subjects  taught, 
background,  and  schools  of  the  fifty-nine  teachers.  No  urban 
schools  or  elementary  teachers  were  used  in  the  core  of  the 
study. 

All  the  participating  teachers  volunteered.  Thus,  many 
schools  were  not  included  because  one  or  more  of  the  members 
of  the  staff  did  not  wish  to  take  part.  The  study  was  conducted 
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under  the  condition  that  the  results  were  not  to  be  used  for 
any  administrative  purposes  in  the  schools  in  which  the  data 
were  collected.  Complete  anonymity  of  the  participants  was 
maintained. 

The  Teacher  Leader  Behavior  Description  Questionnaire 
was  used  to  provide  descriptions  of  teacher  leader  behavior. 
A  global  rating  scale  provided  the  ratings  of  teaching 
effectiveness. 


I.  CONCLUSIONS 

1.  The  most  important  conclusion  was  that  part  of 
teaching  effectiveness  appeared  to  be  related  to  the  extent 
of  a  teacher’s  Consideration  and  Initiating  Structure 
behavior.  The  most  effective  teachers  exhibited  behavior 
that  was  described  as  being  above  the  median  on  both  dimen¬ 
sions  while  the  least  effective  teachers  exhibited  behavior 
that  was  described  as  being  below  the  median  on  both  dimen¬ 
sions  . 


2.  Students  appeared  to  relate  teaching  effective¬ 
ness  to  the  teacher's  leader  behavior  and  there  was  some 
indication  that  they  believed  human  relations  behavior 
(Consideration)  was  more  closely  related  to  effectiveness 
than  was  goal  accomplishment  behavior  (initiating  Structure). 
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3.  Principals  and  superintendents  related  teaching 
effectiveness  to  the  teacher's  leader  behavior.  Neither  of 
these  administrators  considered  that  the  teacher's  behavior 
in  trying  to  attain  goals  was  any  more  closely  related  to 
effectiveness  than  was  the  teacher's  behavior  in  human 
relations . 

4.  The  teachers  themselves  as  judges  were  in  a  category 
distinct  from  all  the  other  judges.  They  showed  very  little, 
if  any,  agreement  with  other  judges  on  ratings  of  effective¬ 
ness  or  on  behavior  descriptions.  They  did  not  show  any 
relationship  between  most  effective  and  most  ineffective  ratings 
and  behavior  descriptions.  It  was  suggested  that  spurious 
factors  affected  their  judgments  to  a  greater  extent  than  it 
affected  judgments  by  others. 

5.  Since  the  amount  of  agreement  on  both  behavior 
dimensions  was  not  greatly  different  from  the  amount  of  agree¬ 
ment  on  effectiveness  ratings,  it  was  concluded  that  it  was 
just  as  difficult  to  agree  on  certain  aspects  of  what  the 
teacher  does  in  the  classroom  as  it  was  to  agree  on  how  well 
he  teaches  in  general. 

6.  The  principals  and  superintendents  probably  have 
had  the  most  experience  in  evaluating  teachers  whether  it  be 
formal  or  informal  evaluation.  They  probably  have  had  the 
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most  supervisory  experience  with  teachers  where  attempts  have 
been  made  to  understand  and  help  Individual  teachers  to  be 
more  effective.  Thus,  they  appeared  to  be  in  a  more  similar 
position  than  any  other  pair  of  judges  with  regard  to  under¬ 
standing  teaching  effectiveness  and  they  showed  the  most  agree¬ 
ment  on  the  ratings  of  effectiveness. 

The  superintendents  and  students  showed  the  least 
agreement  on  the  effectiveness  ratings  (excluding  self- 
ratings)  .  They  were  likely  in  the  most  dissimilar  position 
for  developing  a  frame  of  reference  to  judge  the  effective¬ 
ness  of  a  teacher. 

7.  The  principals  and  other  teachers  showed  the 
greatest  amount  of  agreement  on  both  behavior  dimensions. 

They  were  probably  in  the  most  similar  position  when  it  came 

to  describing  the  teacher's  everyday  behavior  in  the  classroom. 

8.  It  was  concluded  that  an  administrator  might  be 
able  to  infer  a  teacher's  human  relations  behavior  in  the 
classroom  from  personal  contact  with  him  outside  of  the  class¬ 
room.  It  was  not  considered  as  possible  that  an  administrator 
would  be  able  to  infer  a  teacher's  classroom  Initiating 
Structure  behavior  from  personal  contact  with  the  teacher  out¬ 
side  of  the  classroom. 
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9.  The  students  were  the  only  judges  who  were 
constantly  in  contact  with  the  teacher  and  they  were  the 
only  judges  who  showed  some  significant  agreement  on 
descriptions  of  behavior  with  all  other  judges  including 
the  teachers  themselves. 

10.  Conclusions  regarding  the  use  of  the  TLBDQ  as  an 
instrument  to  describe  teacher  leader  behavior  were  indefinite. 
The  reliability  of  the  Initiating  Structure  dimension  was  not 
considered  satisfactorily  high,,  and  the  independence  of  the 
two  dimensions  was  lower  than  desirable.  One  must  conclude 
that  the  present  TLBDQ  should  not  be  used  by  any  one  set  of 
judges  and  that  all  five  viewpoints  used  in  this  study  should 
be  considered. 

II.  IMPLICATIONS 

The  results  of  this  research  may  be  useful  to  those 
persons  concerned  with  the  improvement  of  the  most  critical 
aspect  of  the  educational  system,  the  instructional  personnel  . 

1.  Administrators  will  find  it  valuable  to  know  that 
there  was  some  relationship  between  a  teacher’s  effectiveness 
in  the  classroom  and  his  leader  behavior.  Of  special  signiii- 
cance  was  the  conclusion  that  the  most  effective  teachers  were 
above  the  median  in  both  Consideration  and  Initiating  Structure 
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and.  that  Ineffective  teachers  were  below  the  median  on  both 
dimensions.  The  emphasis  on  one  dimension  to  the  neglect  of 
the  other  might  result  in  an  unsatisfactory  solution  to  a 
particular  problem.  An  understanding  and  study  of  leader 
behavior  will  afford  a  means  whereby  new  and  ineffective 
teachers  might  be  helped  without  the  use  of  any  instrument. 
The  study  of  leader  behavior  might  prove  valuable  for  in- 
service  education  programs. 

2.  There  was  evidence  in  this  research  that  teacher 
leader  behavior,  as  studied  through  the  TLBDQ,  did  not 
account  entirely  for  teacher  effectiveness  or  ineffective¬ 
ness.  Firstly,  behavior  of  the  teacher  which  is  a  product 
of  his  technical  knowledge  such  as  skill  in  subject  matter 
might  be  a  factor.  This  would  vary  from  grade  to  grade  and 
subject  to  subject.  Secondly,  effective  teacher  behavior 
might  be  a  function  of  the  situational  variables,  such  as 
adaptation  to  a  class  of  slow  learners. 

If  the  above  is  true,  then  administrators  must 
realize  that  leader  behavior,  behavior  applicable  to  the 
situation,  and  behavior  reflecting  a  knowledge  of  subject 
matter  must  all  be  considered  in  a  complete  analysis  of 
effectiveness.  Reliance  on  one  or  two  of  the  factors  to 
the  neglect  of  the  other  will  result  in  an  incomplete 
analysis  of  a  problem. 
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3.  Administrators  need  to  broaden  their  frame  of 
reference  for  perceiving  teachers.  They  need  especially  to 
take  into  account  the  student* s  relationships  with  the 
teacher. 

Superintendents  could  use  the  principals ' 
descriptions  of  what  the  teachers  do,  to  supplement  their 
own,  since  their  descriptions  were  at  variance  with  those 
of  the  principals. 

Superintendents  and  principals  need  to  obtain  a 
more  complete  picture  of  how  the  teacher  provides  and  main¬ 
tains  relationships  to  get  things  done  and  to  accomplish 
goals. 

With  regard  to  obtaining  a  broad  picture  of  the 
teacher’s  leader  behavior  in  the  classroom,  administrators 
can  provide  no  more  agreement  among  themselves,  or  with 
students,  than  their  agreement  with  those  other  teachers 
who  have  had  no  opportunity  to  observe  the  teacher's  class¬ 
room  behavior.  This  emphasizes  a  fact,  about  which  most 
administrators  are  well  aware,  that  their  visitations  just 
scratch  the  surface  of  obtaining  a  complete  picture  of  the 
teacher's  performance. 

4.  Administrators  need  to  communicate  with  teachers 
the  perception  that  they  have  of  the  teacher's  effective¬ 
ness  and  behavior.  Also,  they  need  to  obtain  the  teacher's 
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perception  of  himself. 

5.  If  leader  behavior  is  stable,  the  results  may 
have  implications  for  teacher  selection  prior  to  enrolling 
in  a  teachers’  college.  If  leader  behavior  is  not  stable 
and  changes  can  be  produced,  then  the  results  may  have  impli¬ 
cations  for  courses  on  a  teacher  education  program. 

6.  There  are  also  implications  from  this  research 
for  the  selection  of  administrators.  Other  things  being 
equal,  the  most  effective  teacher  could  be  educated  to  be 
the  most  effective  administrator  as  he  would  have  exhibited 
behaviors  necessary  for  successful  leadership. 

Ill .  RECOMMENDATIONS 

1.  Study  is  recommended  to  enlarge  the  understanding 
of  leader  behavior  and  specifically  understanding  of  the  two 
dimensions.  More  complete  knowledge  is  needed  of  what  other 
behavior  items,  aside  from  specific  items  on  the  TLBDQ,  can 
be  classified  as  Consideration  or  Initiating  Structure.  What 
behavior  definitely  cannot  be  classified  under  these  two 
dimensions?  How  does  an  individual  develop  these  behavior 
patterns  and  is  it  possible  to  make  any  significant  change 


in  them? 
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2.  The  conclusion  that  teacher  leader  behavior  is 
to  some  extent  related  to  teaching  effectiveness  needs  to 

be  confirmed  in  other  situations  especially  in  urban  centres 
and  with  elementary  teachers. 

3.  To  make  the  TLBDQ  a  more  effective  instrument  a 
further  check  should  be  conducted  of  the  reliability  of  the 
dimensions.  An  item  analysis  should  be  carried  out  to 
eliminate  items  which  do  not  show  a  relationship  to  their 
respective  dimensions ,  and  to  reduce  the  dependence  between 
the  two  dimensions. 

An  analysis  of  the  use  of  junior  high  school 
students  compared  to  senior  high  school  students  as  respond¬ 
ents  might  be  worthwhile. 

4.  A  study  is  recommended  of  the  relation  between 
Consideration  and  Initiating  Structure  and  the  productivity 
of  the  pupils,  and  also  the  relation  between  Consideration 
and  Initiating  Structure  and  the  teacher  predictor  variables 
such  as  education,  knowledge,  and  intelligence. 

5.  In  the  field  of  teacher  effectiveness  some  study 
is  recommended  of  the  relationship  of  the  situational  vari¬ 
ables  to  teacher  behavior  and  the  relationship  of  the 
situational  variables  to  pupil  productivity. 
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APPENDIX  A 


QUESTIONNAIRES  AND  DIRECTIONS 

TEACHER  LEADER  BEHAVIOR  DESCRIPTION  QUESTIONNAIRE 

On  the  following  pages  is  a  list  of  items  that  may 
he  used  to  describe  the  behavior  of  teachers  in  their  work 
with  students.  Each  item  describes  a  specific  kind  of 
behavior,  but  does  not  ask  you  to  judge  whether  the  behavior 
is  desirable  or  undesirable.  This  is  not  a  test  of  ability. 

It  simply  asks  you  to  describe,  as  accurately  as  you  can, 
the  behavior  of  a  certain  teacher. 

DIRECTIONS: 

a.  READ  each  item  carefully.  Read  "she”  for  "he"  if  the 
teacher  being  described  is  female. 

b.  THINK  about  how  frequently  the  teacher  engages  in  the 
behavior  described  by  the  item. 

c.  DECIDE  whether  he  always,  often,  occasionally,  seldom, 
or  never  acts  as  described  by  the  item. 

d.  DRAW  A  CIRCLE  around  one  of  the  five  letters  following 
the  item  to  show  the  answer  you  have  selected  using  the 
following: 

A  -  Always,  B  -  Often,  C  -  Occasionally,  D  -  Seldom,  E  -  Never. 
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1.  He  helps  students  with  their  personal  problems.  ABODE 

2.  He  makes  his  attitudes  clear  to  the  class.  ABODE 

3.  He  does  little  things  to  make  it  pleasant  to  be 

a  member  of  the  class.  ABODE 

4.  He  tries  out  his  new  ideas  with  the  class.  ABODE 

5.  He  is  easy  to  understand.  ABODE 

6.  He  rules  with  an  iron  hand.  ABODE 

7.  He  finds  time  to  listen  to  students.  ABODE 

8.  He  criticizes  poor  work.  ABODE 

9.  He  speaks  in  a  manner  not  to  be  questioned.  ABODE 

10.  He  talks  with  students  outside  of  class.  ABODE 

11.  He  looks  out  for  the  personal  welfare  of 

individual  class  members.  ABODE 

12.  He  assigns  class  members  to  particular  tasks.  ABODE 

13.  He  schedules  the  work  to  be  done.  ABODE 


14.  He  maintains  definite  standards  of  performance.  ABODE 

15.  He  gives  students  reasons  or  explanations  for 


his  actions.  ABODE 

16.  He  acts  without  consulting  the  class.  ABODE 

17.  He  praises  students.  ABODE 

18.  He  emphasizes  the  meeting  of  deadlines.  ABODE 

19.  He  treats  students  fairly.  ABODE 

20.  He  encourages  the  use  of  uniform  procedures.  ABODE 

21.  He  is  willing  to  make  changes.  ABODE 
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22.  He  makes  sure  that  his  job  as  teacher  is  ABCDE 

understood  by  all  class  members. 

23.  He  is  friendly  and  approachable.  ABCDE 

24.  He  asks  that  students  follow  standard  rules 

and  regulations.  .  ABCDE 

25.  He  makes  students  feel  at  ease  when  talking 

with  him.  ABCDE 

26.  He  lets  class  members  know  what  is  expected 

of  them.  ABCDE 

27.  He  puts  suggestions  made  by  the  class  into 

operation.  ABCDE 

28.  He  sees  to  it  that  class  members  are  working 

up  to  capacity.  ABCDE 

29.  He  gets  students  to  help  in  the  planning  of 

class  activities.  ABCDE 

30.  He  sees  to  it  that  the  work  of  the  class  is 

organized.  ABCDE 


Please  make  sure  that  you  have  answered  every  question. 

NOTE:  Item  number  16  is  scored  negatively. 

The  Consideration  items  are  numbers:  1,  3*  5* 
10,  11,  15,  16,  17,  19,  21,  23,  25,  27  and  29. 

The  Initiating  Structure  items  are  numbers:  2, 
h,  6,  8,  9,  12,  13,  14,  18,  20,  22,  24,  26,  28  and  30. 
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GLOBAL  RATING  OP  TEACHER  EFFECTIVENESS 

Please  check  the  statement  which  in  your  judgment 
best  indicates  how  well  this  teacher  does  in  teaching  his 
students. 

PLEASE  CHECK  IN  ONE  SQUARE. 

1.  Outstanding  for  this  school. 

2.  High  for  this  school. 

3.  Slightly  better  than  average  for  this  school. 

4.  Slightly  below  average  for  this  school. 

5.  Low  for  this  school. 

6.  Very  low  for  this  school. 

DIRECTIONS  FOR  ADMINISTERING  INSTRUMENTS  TO  STUDENTS 

1.  This  is  a  research  project  which  is  studying  teaching. 

You  will  be  asked  to  answer  a  questionnaire  which  will  be 
distributed  later. 

2.  This  study  will  not  mention  names  of  school  *  teachers  or 
students . 

3.  There  was  no  particular  reason  why  this  school  or  teacher 
or  you  as  students  were  chosen.  It  was  all  done  by  chance. 

4.  The  study  is  not  concerned  with  individuals.  The  results 
will  be  about  a  number  of  teachers  in  the  province,  not  those 
in  a  particular  school  or  division. 
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5.  No  one  will  see  the  answered  questionnaires,  but  the 
investigator. 

6.  Do  not  put  your  name  on  the  questionnaire. 

7.  Write  the  teacher’s  name  on  the  slip  attached  at  the  top. 
This  will  be  torn  off  after  you  have  finished  and  a  number 
has  been  placed  on  the  questionnaire. 

8.  If  you  are  in  grades  eight  or  nine  place  a  "J"  on  the 
front  of  your  paper. 

9.  Read  the  printed  instructions  on  the  questionnaire  to 
the  students. 

DIRECTIONS  FOR  ADMINISTERING  INSTRUMENTS  TO 
SUPERINTENDENTS,  TEACHERS  THEMSELVES, 

PRINCIPALS  AND  OTHER  TEACHERS 

1.  Write  the  name  of  the  teacher  being  described  on  the  slip 
which  is  attached.  This  will  be  removed  after  you  have 
finished  and  a  number  has  been  placed  on  the  questionnaire  by 
the  investigator. 

2.  Do  not  write  your  name  on  the  questionnaire. 

3.  Answer  the  questions  about  the  teacher  to  the  best  of 
your  knowledge.  You  may  have  to  infer  what  the  teacher  does 
in  the  classroom  from  your  experience  with  him  or  her. 

4.  Answer  the  questions  individually  without  referring  to 
the  other  questions. 
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5.  On  the  back  page  would  you  please  rate  his  or  her 
teaching  of  students  using  as  a  standard  the  teachers  of 
this  school.  In  your  rating,  fields  of  teacher  activity 
outside  of  teaching  students  should  not  be  included. 
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APPENDIX  B 


LETTER  TO  SUPERINTENDENTS 

The  following  Is  a  copy  of  the  letter  sent  to  pro- 
vincially  appointed  superintendents  of  schools  in  school 
divisions  and  counties  in  the  Province  of  Alberta.  This 
letter  was  the  means  used  to  obtain  the  co-operation  of  the 
twelve  schools  used  in  this  study. 

Dear  Sir: 

This  letter  is  to  inquire  whether  you  would  be  willing 
to  assist  in  a  research  study  which  I  am  conducting  as  a 
graduate  student  in  the  Division  of  Educational  Administration, 
University  of  Alberta. 

The  purpose  of  this  study  is  to  aid  in  the  determina¬ 
tion  of  teacher  competence  and  to  see  if  there  are  common 
patterns  of  effective  performance  of  teachers. 

The  thesis  is  designed  to  study  descriptions  of 
teachers’  classroom  performance  and  to  relate  these  descrip¬ 
tions  to  ratings  of  teacher  effectiveness.  It  is  hoped  that 
it  might  provide  a  more  thorough  background  for  study  of  such 
problems  as  teacher  education,  in-service  education  and  teacher 
selection  by  determining  specific  areas  of  performance  which 
are  related  to  effective  teaching. 


It  is  planned  to  carry  out  the  study  by  using  one 
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school  in  each  of  twelve  Divisions.  In  each  school  five 
teachers  teaching  above  grade  six  would  be  chosen  at  random 
at  the  start  of  the  study.  The  intent  is  to  have  each 
teacher’s  classroom  performance  described  by  ten  students, 
five  other  teachers,  the  principal,  the  superintendent,  and 
the  teacher  himself,  and  also  have  his  teaching  rated  by  the 
same  people.  This  would  involve  the  use  of  two  very  simple 
instruments  which  could  be  completed  in  five  minutes  and  no 
preparation  prior  to  the  administering  of  the  instruments 
would  be  required. 

To  carry  this  out,  you  would  be  asked  to  do  the 
following: 

Obtain  the  co-operation  of  the  principal,  and  the 
entire  staff  of  one  school  in  your  Division.  This  school 
should  have  a  minimum  of  ten  teachers  on  the  staff  and  at 
least  eight  should  teach  above  grade  six. 

Nothing  further  would  be  necessary  until  the  actual 
administering  of  the  instruments  by  myself  which  I  would 
hope  could  be  done  in  the  latter  part  of  April  or  the  first 
week  of  May. 

Since  some  might  be  reluctant  to  take  part  in  a  study 
such  as  this  the  following  should  be  noted: 

I  will  administer  the  instruments  myself  and  no  one 
else  will  see  or  have  access  to  them.  The  information  will 
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be  used  only  for  research. 

The  reported  study  will  involve  no  names  either  of 
individuals  *  schools  or  Divisions.  There  will  be  no  attempt 
to  compare  schools  or  Divisions  and  no  such  breakdown  will 
appear. 

A  list  of  the  names  of  the  five  persons  being 
described  with  corresponding  numbers  will  be  made  up  upon 
arrival  at  the  school  and  be  destroyed  before  leaving  each 
school.  The  instruments  will  bear  the  teacher's  number  and 
his  name  will  appear  on  an  attached  slip  to  serve  only  as 
identification,  so  that  individuals  will  be  describing  the 
same  teacher.  The  slips  then  will  be  destroyed;  all  persons 
taking  part  will  remain  anonymous. 

This  study  has  the  approval  of  the  Alberta  Teachers' 
Association.  I  have  been  a  member  of  the  provincial  execu¬ 
tive  of  the  Saskatchewan  Teachers '  Federation  and  am  very 
concerned  that  all  those  taking  part  would  do  so  willingly 
and  that  no  school  would  be  utilized  in  which  a  single  persm 
did  not  wish  to  do  so. 

I  would  appreciate  an  answer  when  you  have  been  able 
to  determine  whether  one  of  your  schools  would  co-operate, 
and  whether  you  would  be  willing.  You  could  inform  me  as 
to  the  time  which  would  be  satisfactory. 

I  have  enclosed  a  card  to  facilitate  your  reply.  I 
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would  be  only  too  willing  to  provide  additional  information 
or  to  answer  your  questions. 

Yours  sincerely, 

A.  G.  McBeath 
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